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Preface 


This book is an introductory graduate-level textbook on the theory of 
smooth manifolds, for students who already have a solid acquaintance with 
general topology, the fundamental group, and covering spaces, as well as 
basic undergraduate linear algebra and real analysis. It is a natural sequel 
to my earlier book on topological manifolds [Lee00]. 

This subject is often called “differential geometry.” I have mostly avoided 
this term, however, because it applies more properly to the study of smooth 
manifolds endowed with some extra structure, such as a Riemannian met- 
ric, a symplectic structure, a Lie group structure, or a foliation, and of the 
properties that are invariant under maps that preserve the structure. Al- 
though I do treat all of these subjects in this book, they are treated more as 
interesting examples to which to apply the general theory than as objects 
of study in their own right. A student who finishes this book should be 
well prepared to go on to study any of these specialized subjects in much 
greater depth. 

The book is organized roughly as follows. Chapters 1 through 4 are 
mainly definitions. It is the bane of this subject that there are so many 
definitions that must be piled on top of one another before anything in- 
teresting can be said, much less proved. I have tried, nonetheless, to bring 
in significant applications as early and as often as possible. The first one 
comes at the end of Chapter 4, where I show how to generalize the classical 
theory of line integrals to manifolds. 

The next three chapters, 5 through 7, present the first of four major 
foundational theorems on which all of smooth manifolds theory rests—the 
inverse function theorem—and some applications of it: to submanifold the- 
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ory, embeddings of smooth manifolds into Euclidean spaces, approximation 
of continuous maps by smooth ones, and quotients of manifolds by group 
actions. 

The next four chapters, 8 through 11, focus on tensors and tensor fields 
on manifolds, and progress from Riemannian metrics through differential 
forms, integration, and Stokes’s theorem (the second of the four founda- 
tional theorems), culminating in the de Rham theorem, which relates dif 
ferential forms on a smooth manifold to its topology via its singular coho- 
mology groups. The proof of the de Rham theorem I give is an adaptation 
of the beautiful and elementary argument discovered in 1962 by Glen E. 
Bredon [Bre93]. 

The last group of four chapters, 12 through 15, explores the circle of 
ideas surrounding integral curves and flows of vector fields, which are the 
smooth-manifold version of systems of ordinary differential equations. I 
prove a basic version of the existence, uniqueness, and smoothness theo- 
rem for ordinary differential equations in Chapter 12, and use that to prove 
the fundamental theorem on flows, the third foundational theorem. After 
a technical excursion into the theory of Lie derivatives, flows are applied 
to study foliations and the Frobenius theorem (the last of the four founda- 
tional theorems), and to explore the relationship between Lie groups and 
Lie algebras. 

The Appendix (which most readers should read first, or at least skim) 
contains a very cursory summary of prerequisite material on linear algebra 
and calculus that is used throughout the book. One large piece of prereq- 
uisite material that should probably be in the Appendix, but is not yet, 
is asummary of general topology, including the theory of the fundamental 
group and covering spaces. If you need a review of that, you will have to 
look at another book. (Of course, I recommend [Lee00], but there are many 
other texts that will serve at least as well!) 

This is still a work in progress, and there are bound to be errors and 
omissions. Thus you will have to be particularly alert for typos and other 
mistakes. Please let me know as soon as possible when you find any errors, 
unclear descriptions, or questionable statements. Ill post corrections on 
the Web for anything that is wrong or misleading. 

I apologize in advance for the dearth of illustrations. I plan eventually 
to include copious drawings in the book, but I have not yet had time to 
generate them. Any instructor teaching from this book should be sure to 
draw all the relevant pictures in class, and any student studying from them 
should make an effort to draw pictures whenever possible. 


Acknowledgments. There are many people who have contributed to the de- 
velopment of this book in indispensable ways. I would like to mention es- 
pecially Judith Arms and Tom Duchamp, both of whom generously shared 
their own notes and ideas about teaching this subject; Jim Isenberg and 
Steve Mitchell, who had the courage to teach from these notes while they 
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were still in development, and who have provided spectacularly helpful 
suggestions for improvement; and Gary Sandine, who after having found 
an early version of these notes on the Web has read them with incredible 
thoroughness and has made more suggestions than anyone else for improv- 
ing them, and has even contributed several first-rate illustrations, with a 
promise of more to come. 

Happy reading! 


John M. Lee Seattle 
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1 
Smooth Manifolds 


This book is about smooth manifolds. In the simplest terms, these are 
spaces that locally look like some Euclidean space R”, and on which one 
can do calculus. The most familiar examples, aside from Euclidean spaces 
themselves, are smooth plane curves such as circles and parabolas, and 
smooth surfaces R? such as spheres, tori, paraboloids, ellipsoids, and hy- 
perboloids. Higher-dimensional examples include the set of unit vectors in 
IR”*! (the n-sphere) and graphs of smooth maps between Euclidean spaces. 

You are probably already familiar with manifolds as examples of topo- 
logical spaces: A topological manifold is a topological space with certain 
properties that encode what we mean when we say that it “locally looks 
like” R”. Such spaces are studied intensively by topologists. 

However, many (perhaps most) important applications of manifolds in- 
volve calculus. For example, the application of manifold theory to geometry 
involves the study of such properties as volume and curvature. Typically, 
volumes are computed by integration, and curvatures are computed by for- 
mulas involving second derivatives, so to extend these ideas to manifolds 
would require some means of making sense of differentiation and integration 
on a manifold. The application of manifold theory to classical mechanics in- 
volves solving systems of ordinary differential equations on manifolds, and 
the application to general relativity (the theory of gravitation) involves 
solving a system of partial differential equations. 

The first requirement for transferring the ideas of calculus to manifolds is 
some notion of “smoothness.” For the simple examples of manifolds we de- 
scribed above, all subsets of Euclidean spaces, it is fairly easy to describe 
the meaning of smoothness on an intuitive level. For example, we might 
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want to call a curve “smooth” if it has a tangent line that varies continu- 
ously from point to point, and similarly a “smooth surface” should be one 
that has a tangent plane that varies continuously from point to point. But 
for more sophisticated applications, it is an undue restriction to require 
smooth manifolds to be subsets of some ambient Euclidean space. The am- 
bient coordinates and the vector space structure of R” are superfluous data 
that often have nothing to do with the problem at hand. It is a tremen- 
dous advantage to be able to work with manifolds as abstract topological 
spaces, without the excess baggage of such an ambient space. For exam- 
ple, in the application of manifold theory to general relativity, spacetime 
is thought of as a 4-dimensional smooth manifold that carries a certain 
geometric structure, called a Lorentz metric, whose curvature results in 
gravitational phenomena. In such a model, there is no physical meaning 
that can be assigned to any higher-dimensional ambient space in which the 
manifold lives, and including such a space in the model would complicate 
it needlessly. For such reasons, we need to think of smooth manifolds as 
abstract topological spaces, not necessarily as subsets of larger spaces. 

As we will see shortly, there is no way to define a purely topological 
property that would serve as a criterion for “smoothness,” so topological 
manifolds will not suffice for our purposes. As a consequence, we will think 
of a smooth manifold as a set with two layers of structure: first a topology, 
then a smooth structure. 

In the first section of this chapter, we describe the first of these structures. 
A topological manifold is a topological space with three special properties 
that express the notion of being locally like Euclidean space. These prop- 
erties are shared by Euclidean spaces and by all of the familiar geometric 
objects that look locally like Euclidean spaces, such as curves and surfaces. 

In the second section, we introduce an additional structure, called a 
smooth structure, that can be added to a topological manifold to enable us 
to make sense of derivatives. At the end of that section, we indicate how 
the two-stage construction can be combined into a single step. 

Following the basic definitions, we introduce a number of examples of 
manifolds, so you can have something concrete in mind as you read the 
general theory. (Most of the really interesting examples of manifolds will 
have to wait until Chapter 5, however.) We then discuss in some detail how 
local coordinates can be used to identify parts of smooth manifolds locally 
with parts of Euclidean spaces. At the end of the chapter, we introduce 
an important generalization of smooth manifolds, called manifolds with 
boundary. 
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This section is devoted to a brief overview of the definition and properties 
of topological manifolds. We assume the reader is familiar with the basic 
properties of topological spaces, at the level of [Lee00] or [Mun75], for 
example. 

Suppose M is a topological space. We say M is a topological manifold of 
dimension n or a topological n-manifold if it has the following properties: 


e M isa Hausdorff space: For every pair of points p,q € M, there are 
disjoint open subsets U,V C M such that pe U andqe V. 


e M is second countable: There exists a countable basis for the topology 
of M. 


e M is locally Euclidean of dimension n: Every point has a neighbor- 
hood that is homeomorphic to an open subset of R”. 


The locally Euclidean property means that for each p € M, we can find 
the following: 


e an open set U C M containing p; 
e an open set Uc R”; and 


e a homeomorphism y: U — U (i.e, a continuous bijective map with 
continuous inverse). 


Exercise 1.1. Show that equivalent definitions of locally Euclidean spaces 
are obtained if, instead of requiring U to be homeomorphic to an open subset 
of R”, we require it to be homeomorphic to an open ball in R”, or to R” 
itself. 


The basic example of a topological n-manifold is, of course, R”. It is 
Hausdorff because it is a metric space, and it is second countable because 
the set of all open balls with rational centers and rational radii is a count- 
able basis. 

Requiring that manifolds share these properties helps to ensure that 
manifolds behave in the ways we expect from our experience with Euclidean 
spaces. For example, it is easy to verify that in a Hausdorff space, one- 
point sets are closed and limits of convergent sequences are unique. The 
motivation for second countability is a bit less evident, but it will have 
important consequences throughout the book, beginning with the existence 
of partitions of unity in Chapter 2. 

In practice, both the Hausdorff and second countability properties are 
usually easy to check, especially for spaces that are built out of other man- 
ifolds, because both properties are inherited by subspaces and products, as 
the following exercises show. 
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Exercise 1.2. Show that any topological subspace of a Hausdorff space is 
Hausdorff, and any finite product of Hausdorff spaces is Hausdorff. 


Exercise 1.3. Show that any topological subspace of a second countable 
space is second countable, and any finite product of second countable spaces 
is second countable. 


In particular, it follows easily from these two exercises that any open 
subset of a topological n-manifold is itself a topological n-manifold (with 
the subspace topology, of course). 

One of the most important properties of second countable spaces is ex- 
pressed the following lemma, whose proof can be found in [Lee00, Lemma 
2.15}. 


Lemma 1.1. Let M be a second countable topological space. Then every 
open cover of M has a countable subcover. 


The way we have defined topological manifolds, the empty set is a topo- 
logical n-manifold for every n. For the most part, we will ignore this special 
case (sometimes without remembering to say so). But because it is useful 
in certain contexts to allow the empty manifold, we have chosen not to 
exclude it from the definition. 

We should note that some authors choose to omit the the Hausdorff 
property or second countability or both from the definition of manifolds. 
However, most of the interesting results about manifolds do in fact require 
these properties, and it is exceedingly rare to encounter a space “in nature” 
that would be a manifold except for the failure of one or the other of these 
hypotheses. See Problems 1-1 and 1-2 for a couple of examples. 


Coordinate Charts 


Let M be a topological n-manifold. A coordinate chart (or just a chart) on 
M is a pair (U,y), where U is an open subset of M and y: U > U isa 
homeomorphism from U to an open subset U = y(U) C R” (Figure 1.1). 
If in addition U is an open ball in R”, then U is called a coordinate ball. 

The definition of a topological manifold implies that each point p € M is 
contained in the domain of some chart (U, y). If y(p) = 0, we say the chart 
is centered at p. Given p and any chart (U,y) whose domain contains p, 
it is easy to obtain a new chart centered at p by subtracting the constant 
vector y(p). 

Given a chart (U,y), we call the set U a coordinate domain, or a co- 
ordinate neighborhood of each of its points. The map y is called a (local) 
coordinate map, and the component functions of y are called local coordi- 
nates on U. We will sometimes write things like “(U, y) is a chart containing 
p” as a shorthand for “(U,y) is a chart whose domain U contains p.” 
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FIGURE 1.1. A coordinate chart. 


We conclude this section with a brief look at some examples of topological 
manifolds. 


Example 1.2 (Spheres). Let S" denote the (unit) n-sphere, which is 
the set of unit-length vectors in R"*?: 


S" = {2 € Rt: |a| = 1}. 


It is Hausdorff and second countable because it is a subspace of R”. To 
show that it is locally Euclidean, for each index 7 = 1,...,n +1, let US 
denote the subset of S” where the ith coordinate is positive: 


OF SA aca! VES oa S00). 


Similarly, U; is the set where x’ < 0. 


7 


For each such 2, define maps y;: U; 


— R” by 


Cre at Sate) 


9 


where the hat over 2’ indicates that x’ is omitted. Each y; is evidently a 
continuous map, being the restriction to S” of a linear map on R”+!. It is 
a homeomorphism onto its image, the unit ball B” C R”, because it has a 
continuous inverse given by 


(o>) iS (ut, eeu" ty/1 — Jul?,u’,... ,u") : 
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Since every point in S"*? is in the domain of one of these 2n + 2 charts, S” 
is locally Euclidean of dimension n and is thus a topological n-manifold. 


Example 1.3 (Projective Spaces). The n-dimensional real projective 
space, denoted by P” (or sometimes RP”), is defined as the set of 1- 
dimensional linear subspaces of R"*!. We give it the quotient topology 
determined by the natural map 7: R"*! \ {0} — P” sending each point 
x € R"*+\ {0} to the line through z and 0. For any point « € R"*! \ {0}, 
let [x] = (x) denote the equivalence class of x in P”. 

For each i = 1,...,n +1, let U; C R"+! \ {0} be the set where x’ £0, 
and let U; = n(U;) C P”. Since U; is a saturated open set (meaning that it 
contains the full inverse image 1~!(z(p)) for each p € U;), Uj is open and 


TT: U; — U; is a quotient map. Define a map y;: U; — R” by 


x! vl a 
yilx’, Lise orl — ( 


wot 


This map is well-defined because its value is unchanged by multiplying x 
by a nonzero constant, and it is continuous because y; 0 7 is continuous. 
(The characteristic property of a quotient map 7 is that a map f from 
the quotient space is continuous if and only if the composition f o 7 is 
continuous; see [Lee00].) In fact, y; is a homeomorphism, because its inverse 
is given by 


-1 


DO aarti) i sce Cy alex tse |e 


as you can easily check. Geometrically, if we identify R” in the obvious way 
with the affine subspace where x’ = 1, then y;[z] can be interpreted as the 
point where the line [1] intersects this subspace. Because the sets U; cover 
P”, this shows that P” is locally Euclidean of dimension n. The Hausdorff 
and second countability properties are left as exercises. 


Exercise 1.4. Show that P” is Hausdorff and second countable, and is 
therefore a topological n-manifold. 


Smooth Structures 


The definition of manifolds that we gave in the preceding section is suffi- 
cient for studying topological properties of manifolds, such as compactness, 
connectedness, simple connectedness, and the problem of classifying man- 
ifolds up to homeomorphism. However, in the entire theory of topological 
manifolds, there is no mention of calculus. There is a good reason for this: 
Whatever sense we might try to make of derivatives of functions or curves 
on a manifold, they cannot be invariant under homeomorphisms. For ex- 
ample, if f is a function on the circle S', we would want to consider f to 
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be differentiable if it has an ordinary derivative with respect to the an- 
gle @. But the circle is homeomorphic to the unit square, and because of 
the corners the homeomorphism and its inverse cannot simultaneously be 
differentiable. Thus, depending on the homeomorphism we choose, there 
will either be functions on the circle whose composition with the homeo- 
morphism is not differentiable on the square, or vice versa. (Although this 
claim may seem plausible, it is probably not obvious at this point how to 
prove it. After we have developed some more machinery, you will be asked 
to prove it in Problem 5-11.) 

To make sense of derivatives of functions, curves, or maps, we will need 
to introduce a new kind of manifold called a “smooth manifold.” (Through- 
out this book, we will use the word “smooth” to mean C'®, or infinitely 
differentiable. ) 

From the example above, it is clear that we cannot define a smooth 
manifold simply to be a topological manifold with some special property, 
because the property of “smoothness” (whatever that might be) cannot be 
invariant under homeomorphisms. 

Instead, we are going to define a smooth manifold as one with some 
extra structure in addition to its topology, which will allow us to decide 
which functions on the manifold are smooth. To see what this additional 
structure might look like, consider an arbitrary topological n-manifold M. 
Each point in M is in the domain of a coordinate map y: U — U C R”. 
A plausible definition of a smooth function on M would be to say that 
f: M — Ris smooth if and only if the composite function foy~': U = R 
is smooth. But this will make sense only if this property is independent 
of the choice of coordinate chart. To guarantee this, we will restrict our 
attention to “smooth charts.” Since smoothness is not a homeomorphism- 
invariant property, the way to do this is to consider the collection of all 
smooth charts as a new kind of structure on M. In the remainder of this 
chapter, we will carry out the details. 

Our study of smooth manifolds will be based on the calculus of maps 
between Euclidean spaces. If U and V are open subsets of Euclidean spaces 
R” and R™, respectively, a map F': U — V is said to be smooth if each 
of the component functions of F has continuous partial derivatives of all 
orders. If in addition F is bijective and has a smooth inverse map, it is called 
a diffeomorphism. A diffeomorphism is, in particular, a homeomorphism. A 
review of some of the most important properties of smooth maps is given 
in the Appendix. 

Let M be a topological n-manifold. If (U,y), (V,w) are two charts such 
that UNV # @, then the composite map 70 y7!: p(UNV) = W(UNV) 
(called the transition map from y to w) is a composition of homeomor- 
phisms, and is therefore itself a homeomorphism (Figure 1.2). Two charts 
(U, vy) and (V,w) are said to be smoothly compatible if either UNV = & 
or the transition map wo ~~! is a diffeomorphism. (Since g(U NV) and 
w(U NV) are open subsets of R”, smoothness of this map is to be inter- 
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FIGURE 1.2. A transition map. 


preted in the ordinary sense of having continuous partial derivatives of all 
orders.) 

We define an atlas for M to be a collection of charts whose domains cover 
M. An atlas A is called a smooth atlas if any two charts in A are smoothly 
compatible with each other. 

In practice, to show that the charts of an atlas are smoothly compatible, 
it suffices to check that the transition map ~ oy! is smooth for every 
pair of coordinate maps y and wv, for then reversing the roles of y and w 
shows that the inverse map (70 y~!)~! = pow! is also smooth, so each 
transition map is in fact a diffeomorphism. We will use this observation 
without further comment in what follows. 

Our plan is to define a “smooth structure” on M by giving a smooth atlas, 
and to define a function f: M — R to be smooth if and only if fo y~! is 
smooth (in the ordinary sense of functions defined on open subsets of R”) 
for each coordinate chart (U,) in the atlas. There is one minor technical 
problem with this approach: In general, there will be many possible choices 
of atlas that give the “same” smooth structure, in that they all determine 
the same collection of smooth functions on M. For example, consider the 
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following pair of atlases on R”: 


A, = {(R”,Id)} 
Ao = {(Bi(x), Id): « € R"}, 


where B,(x) is the unit ball around x and Id is the identity map. Although 
these are different smooth atlases, clearly they determine the same collec- 
tion of smooth functions on the manifold R” (namely, those functions that 
are smooth in the sense of ordinary calculus). 

We could choose to define a smooth structure as an equivalence class of 
smooth atlases under an appropriate equivalence relation. However, it is 
more straightforward to make the following definition. A smooth atlas A 
on M is maximal if it is not contained in any strictly larger smooth atlas. 
This just means every chart that is smoothly compatible with every chart 
in A is already in A. (Such a smooth atlas is also said to be complete.) 

Now we can define the main concept of this chapter. A smooth structure 
on a topological n-manifold M is a maximal smooth atlas. A smooth mani- 
fold is a pair (M, A), where M is a topological manifold and A is a smooth 
structure on M7. When the smooth structure is understood, we usually omit 
mention of it and just say “M is asmooth manifold.” Smooth structures are 
also called differentiable structures or C'° structures by some authors. We 
will use the term smooth manifold structure to mean a manifold topology 
together with a smooth structure. 

We emphasize that a smooth structure is an additional piece of data 
that must be added to a topological manifold before we are entitled to talk 
about a “smooth manifold.” In fact, a given topological manifold may have 
many different smooth structures (we will return to this issue in the next 
chapter). And it should be noted that it is not always possible to find any 
smooth structure—there exist topological manifolds that admit no smooth 
structures at all. 

It is worth mentioning that the notion of smooth structure can be gen- 
eralized in several different ways by changing the compatibility require- 
ment for charts. For example, if we replace the requirement that charts be 
smoothly compatible by the weaker requirement that each transition map 
wo! (and its inverse) be of class C*, we obtain the definition of a C* 
structure. Similarly, if we require that each transition map be real-analytic 
(i.e., expressible as a convergent power series in a neighborhood of each 
point), we obtain the definition of a real-analytic structure, also called a 
C’ structure. If M has even dimension n = 2m, we can identify R?” with 
C™ and require that the transition maps be complex analytic; this deter- 
mines a complex analytic structure. A manifold endowed with one of these 
structures is called a C* manifold, real-analytic manifold, or complex man- 
ifold, respectively. (Note that a C° manifold is just a topological manifold.) 
We will not treat any of these other kinds of manifolds in this book, but 
they play important roles in analysis, so it is useful to know the definitions. 
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Without further qualification, every manifold mentioned in this book 
will be assumed to be a smooth manifold endowed with a specific smooth 
structure. In particular examples, the smooth structure will usually be 
obvious from the context. If M is a smooth manifold, any chart contained 
in the given maximal smooth atlas will be called a smooth chart, and the 
corresponding coordinate map will be called a smooth coordinate map. 

It is generally not very convenient to define a smooth structure by ex- 
plicitly describing a maximal smooth atlas, because such an atlas contains 
very many charts. Fortunately, we need only specify some smooth atlas, as 
the next lemma shows. 


Lemma 1.4. Let M be a topological manifold. 


(a) Every smooth atlas for M is contained in a unique maximal smooth 
atlas. 


(b) Two smooth atlases for M determine the same maximal smooth atlas 
if and only if their union ts a smooth atlas. 


Proof. Let A be a smooth atlas for M, and let A denote the set of all charts 
that are smoothly compatible with every chart in A. To show that A is a 
smooth atlas, we need to show that any two charts of A are compatible with 
each other, which is to say that for any (U,y), (V,W) € A, woy!: o(UN 
V) = W(U NV) is smooth. 

Let « = y(p) € yp(UNV) be arbitrary. Because the domains of the charts 
in A cover M, there is some chart (W, 6) € A such that p € W. Since every 
chart in A is smoothly compatible with (W, 6), both the maps 00 ~~! and 
wo6-! are smooth where they are defined. Since p€ UNV NW, it follows 
that yoy t = (Wo—!)o(@0y7") is smooth on a neighborhood of x. Thus 
wo + is smooth in a neighborhood of each point in (UNV). Therefore 
A is a smooth atlas. To check that it is maximal, just note that any chart 
that is smoothly compatible with every chart in A must in particular be 
smoothly compatible with every chart in A, so it is already in A. This 
proves the existence of a maximal smooth atlas containing A. If B is any 
other maximal smooth atlas containing A, each of its charts is smoothly 
compatible with each chart in A, so B C A. By maximality of B, B = A. 

The proof of (b) is left as an exercise. 


Exercise 1.5. Prove Lemma 1.4(b). 


For example, if a topological manifold M can be covered by a single 
chart, the smooth compatibility condition is trivially satisfied, so any such 
chart automatically determines a smooth structure on M. 
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Before proceeding further with the general theory, let us establish some 
examples of smooth manifolds. 


Example 1.5 (Euclidean spaces). R” is a smooth n-manifold with the 
smooth structure determined by the atlas consisting of the single chart 
(R”, Id). We call this the standard smooth structure, and the resulting co- 
ordinate map standard coordinates. Unless we explicitly specify otherwise, 
we will always use this smooth structure on R”. 


Example 1.6 (Finite-dimensional vector spaces). Let V be any 
finite-dimensional vector space. Any norm on V determines a topology, 
which is independent of the choice of norm (Exercise A.21 in the Appen- 
dix). With this topology, V has a natural smooth structure defined as fol- 
lows. Any (ordered) basis (£1,...,£,) for V defines a linear isomorphism 
E: R" = V by 


This map is a homeomorphism, so the atlas consisting of the single chart 
(V,E~+) defines a smooth structure. To see that this smooth structure 
is independent of the choice of basis, let (£1,...,£,) be any other basis 
and let E(x) = >i, x) E; be the corresponding isomorphism. There is some 
invertible matrix (A?) such that E; = > j Al E; for each j. The transition 
map between the two charts is then given by E~! o E(x) = %, where 
% = (x1,...,") is determined by 


n n n 
aig Fae a oe t Ad 
SBE; =) aE; = 5 x Ai EB; 

j= t=1 aj=l 


It follows that 7 = 3%, A?a’. Thus the map from z to & is an invertible 
linear map and hence a diffeomorphism, so the two charts are smoothly 
compatible. This shows that the union of the two charts determined by 
any two bases is still a smooth atlas, and thus all bases determine the same 
smooth structure. We will call this the standard smooth structure on V. 


The Einstein Summation Convention 


This is a good place to pause and introduce an important notational con- 
vention that we will use throughout the book. Because of the proliferation 
of summations such as }>, 2’; in this subject, we will often abbreviate 
such a sum by omitting the summation sign, as in 
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We interpret any such expression according to the following rule, called 
the Einstein summation convention: If the same index name (such as 7 in 
the expression above) appears twice in any term, once as an upper index 
and once as a lower index, that term is understood to be summed over 
all possible values of that index, generally from 1 to the dimension of the 
space in question. This simple idea was introduced by Einstein to reduce 
the complexity of the expressions arising in the study of smooth manifolds 
by eliminating the necessity of explicitly writing summation signs. 

Another important aspect of the summation convention is the positions 
of the indices. We will always write basis vectors (such as E;) with lower 
indices, and components of a vector with respect to a basis (such as x’) with 
upper indices. These index conventions help to ensure that, in summations 
that make mathematical sense, any index to be summed over will typically 
appear twice in any given term, once as a lower index and once as an upper 
index. 

To be consistent with our convention of writing components of vectors 
with upper indices, we need to use upper indices for the coordinates of 
a point (#t,...,2”) € R”, and we will do so throughout this book. Al- 
though this may seem awkward at first, in combination with the summa- 
tion convention it offers enormous advantages when working with compli- 
cated indexed sums, not the least of which is that expressions that are not 
mathematically meaningful often identify themselves quickly by violating 
the index convention. (The main exceptions are the Euclidean dot product 
eG = xy’, in which i appears twice as an upper index, and certain 
expressions involving matrices. We will always explicitly write summation 
signs in such expressions. ) 


More Examples 


Now we continue with our examples of smooth manifolds. 


Example 1.7 (Matrices). Let M(m x n,R) denote the space of m x n 
matrices with real entries. It is a vector space of dimension mn under ma- 
trix addition and scalar multiplication. Thus M(m x n, R) is a smooth mn- 
dimensional manifold. Similarly, the space M(m x n,C) of m x n complex 
matrices is a vector space of dimension 2mn over R, and thus a manifold 
of dimension 2mn. In the special case m = n (square matrices), we will 
abbreviate M(n x n,R) and M(n x n,C) by M(n,R) and M(n, C), respec- 
tively. 


Example 1.8 (Open Submanifolds). Let U be any open subset of R”. 
Then U is a topological n-manifold, and the single chart (U,Id) defines a 
smooth structure on U. 
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More generally, let MZ be a smooth n-manifold and U C M any open 
subset. Define an atlas on U by 


Ay = {smooth charts (V,y) for M such that V Cc U}. 


It is easy to verify that this is a smooth atlas for U. Thus any open subset 
of a smooth n-manifold is itself a smooth n-manifold in a natural way. We 
call such a subset an open submanifold of M. 


Example 1.9 (The General Linear Group). The general linear 
group GL(n,R) is the set of invertible n x n matrices with real entries. 
It is an n?-dimensional manifold because it is an open subset of the n?- 
dimensional vector space M(n,R), namely the set where the (continuous) 
determinant function is nonzero. 


Example 1.10 (Matrices of Maximal Rank). The previous example 
has a natural generalization to rectangular matrices of maximal rank. Sup- 
pose m <n, and let M,,(m x n,R) denote the subset of M(m x n,R) 
consisting of matrices of rank m. If A is an arbitrary such matrix, the fact 
that rank A = m means that A has some nonsingular m x m minor. By 
continuity of the determinant function, this same minor has nonzero de- 
terminant on some neighborhood of A in M(m x n,R), which implies that 
A has a neighborhood contained in M,,(m x n,R). Thus M,,(m x n,R) is 
an open subset of M(m x n,R), and therefore is itself an mn-dimensional 
manifold. A similar argument shows that M,,(m x n,R) is an mn-manifold 
when n < m. 


Exercise 1.6. If k is an integer between 0 and min(m,n), show that the 
set of m x n matrices whose rank is at least k is an open submanifold of 
M(m x n,R). 


Example 1.11 (Spheres). We showed in Example 1.2 that the n-sphere 
S" C R"*' is a topological n-manifold. We put a smooth structure on S” 
as follows. For each i = 1,...,n +1, let (U;“,y;) denote the coordinate 
chart we constructed in Example 1.2. For any distinct indices 2 and 7, the 


transition map Fo (y;-)~* is easily computed. In the case i < j, we get 


~5 °(Y; ) Pe cot ae (Hagia kV Tar cast"), 


and a similar formula holds when i > 7. When 7 = 7, an even simpler com- 
putation gives y; 0 (y;) = Idgn. Thus the collection of charts {(U;-, y;-)} 
is a smooth atlas, and so defines a smooth structure on S”. We call this 
its standard smooth structure. The coordinates defined above will be called 
graph coordinates, because they arise from considering the sphere locally 
as the graph of the function ut = +,/1 — |ul?. 
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Exercise 1.7. By identifying R? with C in the usual way, we can think 
of the unit circle S! as a subset of the complex plane. An angle function on 
a subset U C S! is a continuous function 6: U > R such that e?” = p 
for all p € U. Show that there exists an angle function 0 on an open subset 
U CS! if and only if U 4S. For any such angle function, show that (U,@) 
is a smooth coordinate chart for S' with its standard smooth structure. 


Example 1.12 (Projective spaces). The n-dimensional real projective 
space P” is a topological n-manifold by Example 1.3. We will show that 
the coordinate charts (U;, y;) constructed in that example are all smoothly 
compatible. Assuming for convenience that i > j, it is straightforward to 
compute that 


—1y/,1 n 

Pj °OY; fur preng) 
(= ysl ysrt yt 1 yitl “) 
; ’ 


eae J ? PPS? oan} 


us 


2 EE SS 


us?” ui 7 uy us? us 


which is a diffeomorphism from y;(U; N U;) to y;(Ui NU;). 


Example 1.13 (Product Manifolds). Suppose Mj,...,M, are 
smooth manifolds of dimensions nj,...,n, respectively. The product 
space M, x --- x M, is Hausdorff by Exercise 1.2 and second countable 
by Exercise 1.3. Given a smooth chart (U;,y;) for each M;, the map 
1 Xs X YR: Uy xX ++ X Un @ R™ FF" is a homeomorphism onto its 
image, which is an open subset of R™+"+"*, Thus the product set is a 
topological manifold of dimension nj +---+ nx, with charts of the form 
(Uy x---x Uk, pi X-++X YR). Any two such charts are smoothly compatible 
because, as is easily verified, 


(tbr x +++ x De) 0 (Yr X +++ X GR)" = (W109 YT") X +++ K (We o YR"), 


which is a smooth map. This defines a natural smooth manifold structure 
on the product, called the product smooth manifold structure. For example, 
this yields a smooth manifold structure on the n-dimensional torus T” = 
Stx..-x Sl. 


In each of the examples we have seen so far, we have constructed a smooth 
manifold structure in two stages: We started with a topological space and 
checked that it was a topological manifold, and then we specified a smooth 
structure. It is often more convenient to combine these two steps into a 
single construction, especially if we start with a set or a topological space 
that is not known a priori to be a topological manifold. The following 
lemma provides a shortcut. 


Lemma 1.14 (One-Step Smooth Manifold Structure). Let M be a 
set, and suppose we are given a collection {Uq} of subsets of M, together 
with an injective map ~a: Ua — R” for each a, such that the following 
properties are satisfied. 
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(i) For each a, Ux = Ga(Ua) is an open subset of R”. 
(ti) For each a and B, ~Ya(UaN Ug) and yg(UaNUsg) are open in R”. 


(ii) Whenever Un NUg # 2, pp yet: Ya(Ua NU) > Yg(Ua NUz) is 
smooth. 


(iv) Countably many of the sets U, cover M. 


(v) Whenever p,q are distinct points in M, either there exists some Ug 
containing both p and q or there exist disjoint sets U.,Ug with p € Ua 
and q € Ug. 


Then M has a unique smooth manifold structure such that each (Ua, pa) 
is a smooth chart. 


Proof. We define the topology by taking the sets of the form y.'(V), where 
V C Ug is open, as a basis. To prove that this is a basis for a topology, let 
ya '(V) and v3 (W) be two such basis sets. Properties (ii) and (iii) imply 
that Ya © v3 (W) is an open subset of ~a(Ua Ug), and therefore also of 
U.. Thus if p is any point in yy!(V) M v3 (W), then 


va (VN Ga 0 vg (W)) = va'(V)N vB" (W) 


is a basis open set containing p. Each of the maps Yq is then a homeomor- 
phism (essentially by definition), so M is locally Euclidean of dimension n. 
If {U,,} is a countable collection of the sets U, covering M, each of the 
sets Uy, has a countable basis, and the union of all these is a countable 
basis for M, so M is second countable, and the Hausdorff property follows 
easily from (v). Finally, (iii) guarantees that the collection {(Ua, Ya)} is a 
smooth atlas. It is clear that this topology and smooth structure are the 
unique ones satisfying the conclusions of the lemma. 


Example 1.15 (Grassmann Manifolds). Let V be an n-dimensional 
real vector space. For any integer 0 < k < n, we let G,(V) denote the set 
of all k-dimensional linear subspaces of V. We will show that G;(V) can be 
naturally given the structure of a smooth manifold of dimension k(n — k). 
The construction is somewhat more involved than the ones we have done 
so far, but the basic idea is just to use linear algebra to construct charts for 
G,(V) and then use Lemma 1.14 to show that these charts yield a smooth 
manifold structure. Since we will give a more straightforward proof that 
G,(V) is a smooth manifold after we have developed more machinery in 
Chapter 7, you may skip the details of this construction on first reading if 
you wish. 

Let P and Q be any complementary subspaces of V of dimensions & and 
(n—k), respectively, so that V decomposes as a direct sum: V = P®Q. The 


16 1. Smooth Manifolds 


graph of any linear map A: P > Q is a k-dimensional subspace (A) C V, 
defined by 


T(A) = {a+ Ax: ae P}. 


Any such subspace has the property that its intersection with Q is the zero 
subspace. Conversely, any subspace with this property is easily seen to be 
the graph of a unique linear map A: P > Q. 

Let L(P,Q) denote the vector space of linear maps from P to Q, and 
let Ug denote the subset of G,(V) consisting of k-dimensional subspaces 
whose intersection with Q is trivial. Define a map w: L(P,Q) — Ug by 


p(A) =T(A). 


The discussion above shows that w is a bijection. Let p = ~7': Ug - 
L(P,Q). By choosing bases for P and Q, we can identify L(P,Q) with 
M((n—k) xk, R) and hence with R*("—*), and thus we can think of (Ug, ¢) 
as a coordinate chart. Since the image of each chart is all of L(P, Q), con- 
dition (i) of Lemma 1.14 is clearly satisfied. 

Now let (P’,Q’) be any other such pair of subspaces, and let 7’, y’ be 
the corresponding maps. The set y(Ug N Ug) C L(P,Q) consists of all 
A € L(P,Q) whose graphs intersect both Q and Q’ trivially, which is easily 
seen to be an open set, so (ii) holds. We need to show that the transition 
map y’ oy! = y’ oy is smooth on this set. This is the trickiest part of 
the argument. 

Suppose A € y(Ug N Ug’) C L(P,Q) is arbitrary, and let S denote the 
subspace ~(A) =T(A) C V. If we put A’ = vy’ ow(A), then A’ is the unique 
linear map from P’ to Q’ whose graph is equal to S. To identify this map, 
let x’ € P’ be arbitrary, and note that A’x’ is the unique element of Q’ 
such that x’ + A’az’ € S, which is to say that 


z+ A'c’ =x+ Ax for some xe P. (1.1) 


(See Figure 1.3.) There is in fact a unique x € P for which this holds, 
characterized by the property that 


x+Ar—x2' €Q’. 
If we let [4: P — V denote the map [4(x) = «+ Az and let mp: V > P’ 
be the projection onto P’ with kernel Q’, then x satisfies 
0=ap (a+ Av — 2’) = mp ol,a(x) — 2". 


As long as A stays in the open subset of maps whose graphs intersect both 
Q and Q’ trivially, tp: o I4: P — P’ is invertible, and thus we can solve 
this last equation for x to obtain x = (mp o I4)~+(2’). Therefore, A’ is 
given in terms of A by 


Al! = Ign — 2! = 14.0 (apr ota) 12’) — 2’. (1.2) 
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FIGURE 1.3. Smooth compatibility of coordinates on G,(V). 


If we choose bases (7) for P’ and (F;) for Q’, the columns of the matrix 
representation of A’ are the components of A’E/. By (1.2), this can be 
written 


A'E! =I, 0 (mp oI4)71(E)) — El. 


The matrix entries of [4 clearly depend smoothly on those of A, and thus so 
also do those of mp/oI,. By Cramer’s rule, the components of the inverse of 
a matrix are rational functions of the matrix entries, so the expression above 
shows that the components of A’E! depend smoothly on the components 
of A. This proves that y’ o y~! is a smooth map, so the charts we have 
constructed satisfy condition (iii) of Lemma 1.14. 

To check the countability condition (iv), we just note that G;(V) can in 
fact be covered by finitely many of the sets Ug: For example, if (£1,..., En) 
is any fixed basis for V, any partition of the basis elements into two subsets 
containing k and n — k elements determines appropriate subspaces P and 
Q@, and any subspace S$ must have trivial intersection with Q for at least 
one of these partitions (see Exercise A.4). Thus G;(V) is covered by the 
finitely many charts determined by all possible partitions of a fixed basis. 
Finally, the Hausdorff condition (v) is easily verified by noting that for any 
two k-dimensional subspaces P, P’ C V, it is possible to find a subspace Q 
of dimension n — k whose intersections with both P and P’ are trivial, and 
then P and P’ are both contained in the domain of the chart determined 
by, say, (P,Q). 

The smooth manifold G;(V) is called the Grassmann manifold of k- 
planes in V, or simply a Grassmannian. In the special case V = R”, the 
Grassmannian G;(R”) is often denoted by some simpler notation such as 
Grn or G(k, n). Note that G;(R”**) is exactly the n-dimensional projective 
space P”. 
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FIGURE 1.4. A coordinate grid. 


Exercise 1.8. Let 0 < k < n be integers, and let P,Q C R” be the 
subspaces spanned by (e1,...,ex) and (€x41,.--,€n), respectively, where e; 
is the ith standard basis vector. For any k-dimensional subspace $ C R” 
that has trivial intersection with Q, show that the coordinate representation 
(S) constructed in the preceding example is the unique (n—k) x k matrix B 
such that S' is spanned by the columns of the matrix CA, where J; denotes 
the k x k identity matrix. 


Local Coordinate Representations 


Here is how one usually thinks about local coordinate charts on a smooth 
manifold. Once we choose a chart (U, y) on M, the coordinate map y: U > 
U CR” can be thought of as giving an identification between U and us: 
Using this identification, we can think of U simultaneously as an open 
subset of M and (at least temporarily while we work with this chart) as 
an open subset of R”. You can visualize this identification by thinking of a 
“erid” drawn on U representing the inverse images of the coordinate lines 
under y (Figure 1.4). Under this identification, we can represent a point 
p€ M by its coordinates (x',...,2”) = y(p), and think of this n-tuple as 
being the point p. We will typically express this by saying “(x',...,a”) is 
the (local) coordinate representation for p” or “p = (at,...,2”) in local 
coordinates.” 
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FIGURE 1.5. A manifold with boundary. 


Another way to look at it is that by means of our identification U @ U, 
we can think of y as the identity map and suppress it from the notation. 
This takes a bit of getting used to, but the payoff is a huge simplification 
of the notation in many situations. You just need to remember that the 
identification depends heavily on the choice of coordinate chart. 

For example, if M = R?, let U = {(x,y) : x > 0} be the open right 
half-plane, and let y: U — R? be the polar coordinate map (x,y) = 
(r,0) = (\/x? +y?, arctan y/x). We can write a given point p € U ei- 
ther as p = (a,y) in standard coordinates or as p = (r,@) in po- 
lar coordinates, where the two coordinate representations are related by 
(r,0) = (\/a? + y?, arctan y/x) and (a,y) = (rcos6,rsin 6). 


Manifolds With Boundary 


For some purposes, we will need the following generalization of manifolds. 
An n-dimensional topological manifold with boundary is a second count- 
able Hausdorff space in which every point has a neighborhood homeo- 
morphic to an open subset of the closed n-dimensional upper half space 
H” = {(a,...,2") € R”: 2” > 0} (Figure 1.5). An open subset U Cc M 
together with a homeomorphism y from U to an open subset of H” is called 
a generalized chart for M. 

The boundary of H” in R” is the set of points where x” = 0. If M isa 
manifold with boundary, a point that is in the inverse image of OH” under 
some generalized chart is called a boundary point of M, and a point that is 
in the inverse image of Int H” is called an interior point. The boundary of 
M (the set of all its boundary points) is denoted 0M; similarly its interior 
is denoted Int M. 
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Be careful to observe the distinction between this use of the terms 
“boundary” and “interior” and their usage to refer to the boundary and 
interior of a subset of a topological space. A manifold M with boundary 
may have nonempty boundary in this new sense, irrespective of whether it 
has a boundary as a subset of some other topological space. If we need to 
emphasize the difference between the two notions of boundary, we will use 
the terms topological boundary or manifold boundary as appropriate. 

To see how to define a smooth structure on a manifold with boundary, 
recall that a smooth map from an arbitrary subset A C R” is defined 
to be one that extends smoothly to an open neighborhood of A (see the 
Appendix). Thus if U is an open subset of H”, a smooth map F:U—R* 
is a map that extends to a smooth map F:U3 R*, where U is some open 
subset of R” containing U. If F is such a map, by continuity all the partial 
derivatives of F’ at points of OH” are determined by their values in H”, 
and therefore in particular are independent of the choice of extension. It is 
a fact (which we will neither prove nor use) that F: U — R* has such a 
smooth extension if and only if F' is continuous, F'|yqmtH» is smooth, and 
each of the partial derivatives of F|uqmtn has a continuous extension to 
UNH”. 

For example, let B? C R? denote the unit disk, let U = B? 9 H?, and 
denne f:U >R by f(a,y) = /1— 2? — y?. Because f extends to all of 
? (by the same formas, f is asmooth function on U. On the other hand, 
although g(x,y) = \/y is continuous on U and smooth in UN Int H?, it has 
no smooth extension to any neighborhood of U in R? because 0g/Oy — co 
as y > 0. Thus g is not a smooth function on U. 

Given a topological manifold with boundary M, we define an atlas for 
M as before to be a collection of generalized charts whose domains cover 
M. Two such charts (U,y), (V,~) are smoothly compatible if wo y7! is 
smooth (in the sense just described) wherever it is defined. Just as in the 
case of manifolds, a smooth atlas for M is an atlas all of whose charts are 
smoothly compatible with each other, and a smooth structure for M is a 
maximal smooth atlas. 

It can be shown using homology theory that the interior and boundary 
of a topological manifold with boundary are disjoint (see [Lee00, Problem 
13-9], for example). We will not need this result, because the analogous 
result for smooth manifolds with boundary is much easier to prove (or will 
be, after we have developed a bit more machinery). A proof is outlined in 
Problem 5-19. 

Since any open ball in R” admits a diffeomorphism onto an open sub- 
set of H”, a smooth n-manifold is automatically a smooth n-manifold with 
boundary (whose boundary is empty), but the converse is not true: A man- 
ifold with boundary is a manifold if and only if its boundary is empty. (This 
will follow from the fact that interior points and boundary points are dis- 
tinct.) 
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Problems 


1-1. 


1-2. 


1-3. 


1-4. 


1-5. 


Let X be the set of all points (x,y) € R? such that y = +1, and 
let M be the quotient of X by the equivalence relation generated by 
(a,—1) ~ (2,1) for all « 4 0. Show that M is locally Euclidean and 
second countable, but not Hausdorff. [This space is called the line 
with two origins.] 


Show that the disjoint union of uncountably many copies of R is 
locally Euclidean and Hausdorff, but not second countable. 


Let N = (0,...,0,1) be the “north pole” and S = —N the “south 
pole.” Define stereographic projection 0: S" \ {N} — R” by 


1 n 
1 +1 Cae 
lB s e230? ) ene 


Let a(x) = o(—2) for x € S”\ {S}. 
(a) Show that a is bijective, and 


(Qut,...,2u™, |u|? — 1) 


Cae ee a ul? +1 


(b) Compute the transition map Go o~! and verify that the atlas 
consisting of the two charts (S" \ {N},o) and (S” \ {S},a) 
defines a smooth structure on S”. (The coordinates defined by 
o or @ are called stereographic coordinates.) 


(c) Show that this smooth structure is the same as the one defined 
in Example 1.11. 


Let M be a smooth n-manifold with boundary. Show that Int M is 
a smooth n-manifold and 0M is a smooth (n — 1)-manifold (both 
without boundary). 


Let M = B”, the closed unit ball in R”. Show that M is a mani- 
fold with boundary and has a natural smooth structure such that its 
interior is the open unit ball with its standard smooth structure. 
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2 
Smooth Maps 


The main reason for introducing smooth structures was to enable us to 
define smooth functions on manifolds and smooth maps between manifolds. 
In this chapter, we will carry out that project. 

Although the terms “function” and “map” are technically synonymous, 
when studying smooth manifolds it is often convenient to make a slight dis- 
tinction between them. Throughout this book, we will generally reserve the 
term “function” for a map whose range is R (a real-valued function) or R* 
for some k > 1 (a vector-valued function). The word “map” or “mapping” 
can mean any type of map, such as a map between arbitrary manifolds. 

We begin by defining smooth real-valued and vector-valued functions, 
and then generalize this to smooth maps between manifolds. We then study 
diffeomorphisms, which are bijective smooth maps with smooth inverses. If 
there is a diffeomorphism between two manifolds, we say they are diffeomor- 
phic. The main objects of study in smooth manifold theory are properties 
that are invariant under diffeomorphisms. 

Later in the chapter, we study smooth covering maps, and their relation- 
ship to the continuous covering maps studied in topology; and we introduce 
Lie groups, which are smooth manifold that are also groups in which mul- 
tiplication and inversion are smooth maps. 

At the end of the chapter, we introduce some powerful tools for smoothly 
piecing together local smooth objects, called bump functions and partitions 
of unity. They will be used throughout the book for building global smooth 
objects out of ones that are initially defined only locally. 
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Smooth Functions and Smooth Maps 


If M is a smooth manifold, a function f: M — R* is said to be smooth 
if, for every smooth chart (U,y) on M, the composite function f o p~ 
is smooth on the open subset y(U) C R”. The most important special 
case is that of smooth real-valued functions f: M — R; the set of all such 
functions is denoted by C®(M). Because sums and constant multiplies of 
smooth functions are smooth, C~(M) is a vector space. In fact, it is a ring 
under pointwise multiplication, as you can easily verify. 

Although by definition smoothness of f means that its composition with 
every smooth coordinate map is smooth, in practice it suffices to check 
smoothness in each of the charts of some smooth atlas, as the next lemma 
shows. 


Lemma 2.1. Suppose {(Ua,%a)} is a smooth atlas for M. If f: M — R* 
is a function such that f oy, is smooth for each a, then f is smooth. 


Proof. We just need to check that fo y~! is smooth for any smooth chart 
(U,y) on M. It suffices to show it is smooth in a neighborhood of each 
point « = y(p) € y(U). For any p € U, there is a chart (Ua, Yq) in the 
atlas whose domain contains p. Since (U,y) is smoothly compatible with 
(Ua; Ya), the transition map yoy! is smooth on its domain of definition, 
which includes x. Thus f 0 y~! = (f 0 yy!) 0 (va 0 y+) is smooth in a 
neighborhood of x. 


Given a function f: M — R* and a chart (U,y) for M, the function 
f: y(U) — R* defined by f(x) = fog 1(za) is called the coordinate rep- 
resentation of f. For example, consider f(x,y) = x? + y? on the plane. 
In polar coordinates, it has the coordinate representation flr, 6) = r?7. In 
keeping with our practice of using local coordinates to identify U with a 
subset of Euclidean space, in cases where it will cause no confusion we will 
often not even observe the distinction between f and f itself, and write 
f(r,9) = r? in polar coordinates. Thus, we might say “f is smooth on U 
because its coordinate representation f(r,@) =r? is smooth.” 

The definition of smooth functions generalizes easily to maps between 
manifolds. Let M7, N be smooth manifolds, and let F: M — N be any 
map. We say F is a smooth map if, for any smooth charts (U, y) for M and 
(V,w) for N, the composite map yo Foy! is smooth from p(UNF~1(V)) 
to w(V). Note that our previous definition of smoothness of real-valued 
functions can be viewed as a special case of this one, by taking N = R* 
and w = Id. 


Exercise 2.1 (Smoothness is Local). Let F: M — N be a map be- 
tween smooth manifolds, and suppose each point p € M has a neighborhood 
U such that F'|y is smooth. Show that F is smooth. 
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We call F = woFoy! the coordinate representation of F with respect to 
the given coordinates. As with real-valued or vector-valued functions, once 
we have chosen specific local coordinates in both the domain and range, we 
can often ignore the distinction between F’ and F. 

Just as for functions, to prove that a map is smooth it suffices to show 
that its coordinate representatives with respect to a particular smooth atlas 
are smooth. The proof is analogous to that of Lemma 2.1 and is left as an 
exercise. 


Lemma 2.2. Let M, N be smooth manifolds and let F: M — N be any 
map. If {(Ua;Pa)} and {(Vg,vea)} are smooth atlases for M and N, re- 
spectively, and if for each a and B, Wgo Foy! is smooth on its domain 
of definition, then F is smooth. 


Exercise 2.2. Prove Lemma 2.2. 


Lemma 2.3. Any composition of smooth maps between manifolds is 
smooth. 


Proof. Given smooth maps F': M — N and G: N — P, let (U,y) and 
(V,w) be any charts for M and P respectively. We need to show that 
wo(GoF)oy~t is smooth where it is defined, namely on y(UN(GoF’)~1(V)). 
For any point p € UN(Go F)~1(V), there is a chart (W,@) for N such that 
F(p) € W. Smoothness of F and G means that 00 Foy! and woGo@"! 
are smooth where they are defined, and therefore 7 o (Go F)o pt = 
(boGo6')o0(@0F ow?) is smooth. 


Example 2.4 (Smooth maps). 


(a) Consider the n-sphere S” with its standard smooth structure. The 
inclusion map v: S” — R”t! is certainly continuous, because it is 
the inclusion map of a topological subspace. It is a smooth map be- 
cause its coordinate representation with respect to any of the graph 
coordinates of Example 1.11 is 


OG cee Oh SEO) (Ucn sae) 
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which is smooth on its domain (the set where |u|? <1). 


(b) The quotient map 7: R"*! \ {0} — P” is smooth, because its coor- 
dinate representation in terms of the coordinates for P” constructed 
in Example 1.12 and standard coordinates on R"*! \ {0} is 


a Gone ane) = Coe oto 
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er 
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(c) Define p: S" — P” as the restriction of 7: R"t+ \ {0} — P” to 
S” c R™*1 \ {0}. It is a smooth map, because it is the composition 
p= oc. of the maps in the preceding two examples. 


Exercise 2.3. Let M,...,M, and N be smooth manifolds. Show that a 
map F': N > M, x---x Mx is smooth if and only if each of the “component 
maps” F; = 7; 0 F: N — M; is smooth. (Here m: Mi x--- x Mp — M; is 
the projection onto the ith factor.) 


The definitions of smooth functions and smooth maps on a manifold 
with boundary are exactly the same as for manifolds; you can work out the 
details for yourself. 


Diffeomorphisms 


A diffeomorphism between manifolds M and N is a smooth map F': M > 
N that has a smooth inverse. We say M and N are diffeomorphic if there 
exists a diffeomorphism between them. Sometimes this is symbolized by 
M = N. For example, if B” denotes the open unit ball in R”, the map 
F: B” > R” given by F(x) = x/(1 —|2|?) is easily seen to be a diffeomor- 
phism, so B” = R”. 


Exercise 2.4. Show that “diffeomorphic” is an equivalence relation. 


More generally, F: M — N is called a local diffeomorphism if every 
point p € M has a neighborhood U such that F(U) is open in N and 
F:U — F(U) is a diffeomorphism. It is clear from the definition that a 
local diffeomorphism is, in particular, a local homeomorphism and therefore 
an open map. 


Exercise 2.5. Show that a map F': M — N is a diffeomorphism if and 
only if it is a bijective local diffeomorphism. 


Just as two topological spaces are considered to be “the same” if they 
are homeomorphic, two smooth manifolds are essentially indistinguishable 
if they are diffeomorphic. The central concern of smooth manifold theory 
is the study of properties of smooth manifolds that are preserved by dif- 
feomorphisms. 

One question that naturally arises is to what extent a smooth structure 
on a given topological manifold might be unique. There are really two 
different questions here: The first is whether a given manifold M7 admits 
distinct smooth structures, and the second is whether it admits smooth 
structures that are not diffeomorphic to each other. 

Let us begin by addressing the first question. It is easy to see that two 
smooth structures A;, Az on a given manifold M are the same if and only 
if the identity map of M is a diffeomorphism from (M, Aj) to (M, Az). 


Smooth Functions and Smooth Maps 27 


In general, a given topological manifold will admit very many distinct 
smooth structures. For example, consider the two homeomorphisms y: R = 
R and =: R — R given by 


Each of the atlases {(R,y)} and {(R,~)} determines a smooth structure 
on R. (Since there is only one chart in each case, the smooth compati- 
bility condition is trivially satisfied.) These two charts are not smoothly 
compatible with each other, because yo w!(y) = y'/3 is not smooth at 
the origin. Therefore the two smooth structures on R determined by these 
atlases are distinct. Using similar ideas, it is not hard to construct many 
different smooth structures on any given manifold. 

The second question, whether two given smooth structures are diffeomor- 
phic to each other, is more subtle. Consider the same two smooth structures 
on R, and for the moment let R, denote R with the smooth structure deter- 
mined by ¢ (this is just the standard smooth structure) and Ry, the same 
topological manifold but with the smooth structure determined by w. It 
turns out that these two manifolds are diffeomorphic to each other. Define 
amap F: R, — Ry by F(x) = «!/%. The coordinate representation of this 
map is F(t) = WoFog l(t) =t, which is clearly smooth. Moreover, the 
coordinate representation of its inverse is F-1(y) =ypoF lowly) =y, 
which is also smooth, so F' is a diffeomorphism. (This is one case in which it 
is important to maintain the distinction between a map and its coordinate 
representation! ) 

It turns out, as you will see later, that there is only one smooth struc- 
ture on R up to diffeomorphism (see Problem 12-5). More precisely, if A, 
and A» are any two smooth structures on R, there exists a diffeomorphism 
F: (R, Ai) — (R, Az). In fact, it follows from work of Edwin Moise [Moi77] 
and James Munkres [Mun60] that every topological manifold of dimension 
less than or equal to 3 has a smooth structure that is unique up to dif- 
feomorphism. The analogous question in higher dimensions turns out to 
be quite deep, and is still largely unanswered. Even for Euclidean spaces, 
the problem was not completely solved until late in the twentieth century. 
The answer is somewhat surprising: As long as n # 4, R” has a unique 
smooth structure (up to diffeomorphism); but R*+ has uncountably many 
distinct smooth structures, no two of which are diffeomorphic to each other! 
The existence of nonstandard smooth structures on R* (called fake R‘s) 
was first proved by Simon Donaldson and Michael Freedman in 1984 as 
a consequence of their work on the geometry and topology of compact 
4-manifolds; the results are described in [DK90] and [FQ90]. 

For compact manifolds, the situation is even more interesting. For ex- 
ample, in 1963, Michel Kervaire and John Milnor [KM63] showed that, up 
to diffeomorphism, S” has exactly 28 non-diffeomorphic smooth structures. 
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On the other hand, in all dimensions greater than 3 there are compact 
topological manifolds that have no smooth structures at all. (The first ex- 
ample was found in 1960 by Kervaire [Ker60].) The problem of identifying 
the number of smooth structures (if any) on topological 4-manifolds is an 
active subject of current research. 


Smooth Covering Maps 


You are probably already familiar with the notion of a covering map be- 
tween topological spaces: This is a surjective continuous map 7: M — M 
between connected, locally path connected spaces, with the property that 
every point p € M has a neighborhood U that is evenly covered, meaning 
that each component of 7~1(U) is mapped homeomorphically onto U by 
am. In this section, we will assume familiarity with the basic properties of 
covering maps, as described for example in [Lee00, Chapters 11 and 12]. 

In the context of smooth manifolds, it is useful to introduce a slightly 
more restrictive type of covering map. If M and M are connected smooth 
manifolds, a smooth covering map 7: M — M is a smooth surjective map 
with the property that every p € M has a neighborhood U such that each 
component of 7~!(U) is mapped diffeomorphically onto U by 7. In this 
context, we will also say that U is evenly covered. The manifold M is 
called the base of the covering, and M is called a covering space of M. 

To distinguish this new definition from the previous one, we will often call 
an ordinary (not necessarily smooth) covering map a topological covering 
map. A smooth covering map is, in particular, a topological covering map. 
However, it is important to bear in mind that a smooth covering map is 
more than just a topological covering map that happens to be smooth—the 
definition of smooth covering map requires in addition that the restriction 
of 7 to each component of the inverse image of an evenly covered set be a 
diffeomorphism, not just a smooth homeomorphism. 


Proposition 2.5 (Properties of Smooth Coverings). 
(a) Any smooth covering map is a local diffeomorphism and an open map. 
(b) An injective smooth covering map is a diffeomorphism. 


(c) A topological covering map is a smooth covering map if and only if it 
is a local diffeomorphism. 


Exercise 2.6. Prove Proposition 2.5. 


If 7: M _— Mis any continuous map, a section of 7 is a continuous map 
a: M —_M such that too = Idy. A local section is a continuous map 
a: U > M defined on some open set U C M and satisfying the analogous 
relation 700 = Idy. Many of the important properties of smooth covering 
maps arise from the existence of smooth local sections. 
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Lemma 2.6 (Local Sections of Smooth Coverings). Suppose 
a: M > M is a smooth covering map. Every point of M is in the image 
of a smooth local section of 7. More precisely, for any q € M, there is a 
neighborhood U of p = 7(q) and a smooth local section 0: U + M such 
that o(p) = q. 


Proof. Let U Cc M be an evenly covered neighborhood of p. If U is the 
component of 7~'(U) containing q, then m|™: U — U is by hypothesis a 
diffeomorphism. It follows that o = (|g%)~': U > U is a smooth local 
section of 7 such that o(p) = q. 


One important application of local sections is the following proposition, 
which gives a very simple criterion for deciding which maps out of the base 
space of a covering are smooth. 


Proposition 2.7. Suppose 7: M = M is a smooth covering map and 
N is any smooth manifold. A map F: M — N is smooth if and only if 
For: M = M is smooth: 


M 
Tv For 
M N. 


Proof. One direction is obvious by composition. Suppose conversely that 
For is smooth, and let p € M be arbitrary. By the preceding lemma, there 
is a neighborhood U of p and a smooth local section 0: U — M, so that 
noo =Idy. Then the restriction of F to U satisfies 


Fly =Foldy =Fo(noa)=(Fon)oo, 


which is a composition of smooth maps. Thus F’ is smooth on U. Since F 
is smooth in a neighborhood of each point, it is smooth. 


The next proposition shows that every covering space of a smooth man- 
ifold is itself a smooth manifold. 


Proposition 2.8. If M is a smooth manifold and m: M = M is any 
topological covering map, then M has a unique smooth manifold structure 
such that 7 is a smooth covering map. 


Proof. Because 7 is, in particular, a local homeomorphism, it is clear that 
M is locally Euclidean. _ 

Let p,q be distinct points in M. If z(p) = z(q) and U C M is an 
evenly covered open set containing 7(p), then the components of 7~1(U) 


containing p and q are disjoint open subsets of M separating p and gq. 


30 2. Smooth Maps 


On the other hand, if m(p) 4 z(q), there are disjoint open sets U and V 
containing 7(p) and (gq), respectively, and then t~1(U) and 7~1(V) are 
open subsets of M separating p and q. Thus M is Hausdorff. 

The fibers of 7 are countable, because the fundamental group of M is 
countable and acts transitively on each fiber [Lee00, Theorems 8.11 and 
11.21]. Thus if {U;}ien is a countable basis for the topology of M , it is easy 
to check that the set of all components 0 of 7~ 1WUi ) as 7 ranges over N forms 
a countable basis for the topology of M, so M is second countable. 

Any point p € M has an evenly Govered neighborhood U. Shrinking U if 
peut we may assume also that it is the domain of a coordinate map 

: U + R”. Letting U bea component of 7 1(U) and @= pom: UR", 
4 is clear that (U, @) is a chart on M. If two such charts (U,S) and (V, v) 
overlap, the transition map can be written 
poP* =(pomlgar)o(votleagy) 
=ywo Tle ae ° (maar) oy 
=pog™, 
which is smooth. Thus the collection of all such charts defines a smooth 


structure on M. The uniqueness of this smooth structure is left as an 
exercise. 


-1 


Exercise 2.7. Prove that the smooth structure constructed above on M 
is the unique one such that 7 is a smooth covering map. [Hint: Use the 
existence of smooth local sections.] 


Lie Groups 


A Lie group is a smooth manifold G that is also a group in the algebraic 
sense, with the property that the multiplication map m: G x G — G and 
inversion map 7: G — G, given by 


m(g,h)= gh, i(g)=g™", 


are both smooth. Because smooth maps are continuous, a Lie group is, in 
particular, a topological group (a topological space with a group structure 
such that the multiplication and inversion maps are continuous). 

The group operation in an arbitrary Lie group will be denoted by juxta- 
position, except in certain abelian groups such as R” in which the operation 
is usually written additively. It is traditional to denote the identity element 
of an arbitrary Lie group by the symbol e (for German Einselement, “unit 
element”), and we will follow this convention, except in specific examples 
in which there are more common notations (such as J or I, for the identity 
matrix in a matrix group, or 0 for the identity element in R”). 
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The following alternative characterization of the smoothness condition 
is sometimes useful. 


Lemma 2.9. Suppose G is a smooth manifold with a group structure such 
that the map G x G => G given by (g,h) + gh7! is smooth. Then G is a 
Lie group. 


Exercise 2.8. Prove Lemma 2.9. 


Example 2.10 (Lie Groups). Each of the following manifolds is a Lie 
group with the indicated group operation. 


(a) 


The general linear group GL(n, R) is the set of invertible n x n matri- 
ces with real entries. It is a group under matrix multiplication, and 
it is an open submanifold of the vector space M(n, R) as we observed 
in Chapter 1. Multiplication is smooth because the matrix entries of 
a product matrix AB are polynomials in the entries of A and B. In- 
version is smooth because Cramer’s rule expresses the entries of A! 
as rational functions of the entries of A. 


The complex general linear group GL(n,C) is the group of complex 
nm Xn matrices under matrix multiplication. It is an open submanifold 
of M(n,C) and thus a 2n?-dimensional smooth manifold, and it is a 
Lie group because matrix products and inverses are smooth functions 
of the real and imaginary parts of the matrix entries. 


If V is any real or complex vector space, we let GL(V) denote the 
group of invertible linear transformations from V to itself. If V is 
finite-dimensional, any basis for V determines an isomorphism of 
GL(V) with GL(n,R) or GL(n,C), with n = dimV, so GL(V) is 
a Lie group. 


The real number field R and Euclidean space R” are Lie groups un- 
der addition, because the coordinates of x — y are smooth (linear!) 
functions of (a, y). 


The set R* of nonzero complex numbers is a 1-dimensional Lie group 
under multiplication. (In fact, it is exactly GL(1,R), if we identify 
a 1x 1 matrix with the corresponding real number.) The subset R* 
of positive real numbers is an open subgroup, and is thus itself a 
1-dimensional Lie group. 


The set C* of nonzero complex numbers is a 2-dimensional Lie group 
under complex multiplication, which can be identified with GL(1, C). 


The circle S' C C* is a smooth manifold and a group under complex 
multiplication. Using appropriate angle functions as local coordinates 
on open subsets of S', multiplication and inversion have the smooth 
coordinate expressions (01,02) +> 6; + @2 and 6++ —86, and therefore 
S! is a Lie group, called the circle group. 


62 3. The Tangent Bundle 
Proof. Let (a) be coordinates on a neighborhood U of p, and let X‘0/0z"|, 


be the coordinate expression for X. If y is a bump function supported in 
U and with y(p) = 1, the vector field X defined by 


qe U, 


0 q¢ U, 


is easily seen to be a smooth vector field whose value at p is equal to X. 


We will use the notation J(M/) to denote the set of all smooth vector 
fields on M. It is clearly a real vector space under pointwise addition and 
scalar multiplication. Moreover, vector fields can be multiplied by smooth 
functions: If f € C°(M) and Y € J(M), we obtain a new vector field fY 
by 


(FY )p = f(p)¥p. 


(Many authors use the notation X(M) instead of J(M). However, J(M) 
is more amenable to generalization—as a rule, we will use the script letter 
corresponding to the name of a bundle to denote the its space of smooth 
sections. ) 


Exercise 3.7. If f € C°(M) and Y € J(M), show that fY is a smooth 
vector field. 


Exercise 3.8. Show that T(J) is a module over the ring C*°(M). 


If M is a smooth manifold, we will use the term local frame for M to 
mean a local frame for TM over some open subset U C M. Similarly, a 
global frame for M is a global frame for TM. We say M is parallelizable if 
it admits a smooth global frame, which is equivalent to TM being a trivial 
bundle (see Problem 3-5). 

A vector field on a manifold with boundary is defined in exactly the same 
way as on a manifold. All of the results of this section hold equally well in 
that case. 


Push-forwards of Vector Fields 


If F: M — N is a smooth map and Y is a smooth vector field on M, 
then for each point p € M, we obtain a vector FY, € Trip) N by pushing 
forward Y,. However, this does not in general define a vector field on N. 
For example, if F’ is not surjective, there is no way to decide what vector to 
assign to a point q€ N\ F(M). If F is not injective, then for some points 
of N there may be several different vectors obtained as push-forwards of Y 
from different points of M. 
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If F: M > N is smooth and Y € J(M), suppose there happens to be a 
vector field Z € J(.N) with the property that for each p €¢ M, FxY, = Zp). 
In this case, we say the vector fields Y and Z are F’-related. 

Here is a useful criterion for checking that two vector fields are F-related. 


Lemma 3.17. Suppose F: M > N is a smooth map, X € T(M), and 
Y € T(N). Then X and Y are F-related if and only if for every smooth 
function f defined on an open subset of N, 


X(foF)=(Xf)oF. (3.7) 
Proof. For any pe M, 


X (fo F)(p) = X,(fo F) 
= (FXp)f, 
while 
(Xf) o F(p) = (Xf)(F(P)) 
= Xp f. 


Thus (3.7) is true for all f if and only if F,.X, = XF,p) for all p, ie., if and 
only if X and X are F-related. 


It is important to remember that for a given vector field Y and map F, 
there may not be any vector field on N that is F-related to Y. There is 
one special case, however, in which there is always such a vector field, as 
the next proposition shows. 


Proposition 3.18. Suppose F: M — N is a diffeomorphism. For every 
smooth vector field Y € J(M), there is a unique smooth vector field on N 
that is F'-related to Y . 


Proof. For Z € J(N) to be F-related to Y means that FY, = Zp») for 
every p € M. If F is a diffeomorphism, therefore, we define Z by 


Zq = F(Yr-1(q))- 


It is clear that Z, so defined, is the unique vector field that is F-related to 
Y, and it is smooth because it is equal to the composition 


Fu PN, 


(See Exercise 3.4.) 


In the situation of the preceding lemma, we will denote the unique vector 
field that is F-related to Y by FY, and call it the push-forward of Y by 
F’. Remember, it is only when F is a diffeomorphism that FY is defined. 
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3. The Tangent Bundle 


Problems 


3-1. 


3-4. 


3-5. 


3-6. 


3-7. 


Suppose M and N are smooth manifolds with M connected, and 
F: M — N is a smooth map such that F.: T,M — Trp)N is the 
zero map for each p € M. Show that F' is a constant map. 


. Let Mj,..., Mz be smooth manifolds, and let 7;: My x --- x Mp > 


M; be the projection onto the jth factor. For any choices of points 
pi€ Mj, i=1,...,k, show that the map 


a: Typ, ,...px)(Ma x +++ X Mz) > Tp, M1 x +++ X Ty, Me 
defined by 
a(X) = (m14X,..., The X) 
is an isomorphism, with inverse 
Or (Xa ape) = Vig ae IeE XE 


where j;: Mj; — M, x --- x Meg is given by ji(q) = 
(p1,-+-;Pi-1;4; Pit1,---,Pk)- [Using this isomorphism, we will rou- 
tinely identify TM, for example, as a subspace of T(p,q)(M x N).|] 


. If a nonempty n-manifold is diffeomorphic to an m-manifold, prove 


that n = m. 


Show that there is a smooth vector field on S? that vanishes at exactly 
one point. [Hint: Try using stereographic projection.| 


Let E be a smooth vector bundle over M. Show that E admits a 
local frame over an open subset U C M if and only if it admits a 
local trivialization over U, and FE admits a global frame if and only 
if it is trivial. 
Show that S!, S?, and T” = S! x --- x S! are all parallelizable. [Hint: 
Consider the vector fields 

O O O O 
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on $3, 
Let M be asmooth manifold and p € M. Show that T,M is naturally 


isomorphic to the space of derivations of C>° (the space of germs of 
smooth functions at p). 


4 
The Cotangent Bundle 


In this chapter, we introduce a construction that is not typically seen in 
elementary calculus: tangent covectors, which are linear functionals on the 
tangent space at a point p © M. The space of all covectors at p is a vector 
space called the cotangent space at p; in linear-algebraic terms, it is the 
dual space to T;,,M. The union of all cotangent spaces at all points of M is 
a vector bundle called the cotangent bundle. 

Whereas tangent vectors give us a coordinate-free interpretation of deriv- 
atives of curves, it turns out that derivatives of real-valued functions on a 
manifold are most naturally interpreted as tangent covectors. Thus we will 
define the differential of a function as a covector field (a smooth section of 
the cotangent bundle); it is a sort of coordinate-invariant analogue of the 
classical gradient. 

At the end of the chapter, we define line integrals of covector fields. This 
allows us to generalize the classical notion of line integrals to manifolds. 
Then we explore the relationships among three closely related types of 
covector fields: exact (those that are the differentials of functions), conser- 
vative (those whose line integrals around closed curves are zero), and closed 
(those that satisfy a certain differential equation in coordinates). 


Covectors 


Let V be a finite-dimensional vector space. (As usual, all of our vector 
spaces are assumed to be real.) We define a covector on V to be a real- 
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valued linear functional on V, that is, a linear map w: V — R. The space of 
all covectors on V is itself a real vector space under the obvious operations 
of pointwise addition and scalar multiplication. It is denoted by V* and 
called the dual space to V. 

The most important fact about V* is expressed in the following propo- 
sition. 
Proposition 4.1. Let V be a_ finite-dimensional vector space. If 
(F,,...,En) is any basis for V, then the covectors (e1,...,€"), defined 
by 


form a basis for V*, called the dual basis to (E;). Therefore dimV* = 
dim V. 


Remark. The symbol ov used in this proposition, meaning 1 if 7 = 7 and 0 
otherwise, is called the Kronecker delta. 


Exercise 4.1. Prove Proposition 4.1. 


For example, if (e;) denotes the standard basis for R”, we denote the dual 
basis by (e',...,e”) (note the upper indices), and call it the standard dual 
basis. These basis covectors are the linear functions from R” to R given by 


ei(v) =e (v',...,u%) =v". 


In other words, e/ is just the linear functional that picks out the jth com- 
ponent of a vector. In matrix notation, a linear map from R” to R is 
represented by a 1 x n matrix, i.e., a row matrix. The basis covectors can 
therefore also be thought of as the linear functionals represented by the 
row matrices 


eS Oinge Oy dia og BS Ores Os 


In general, if (E;) is a basis for V and (e/) is its dual basis, then Propo- 
sition 4.1 shows that we can express an arbitrary covector w € V* in terms 
of the dual basis as 


W = Wie", 
where the components w; are determined by 
w; = w( Ei). 
Then the action of w on a vector X = X'E; is 


w(X) =u, X*. (4.1) 
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We will always write basis covectors with lower indices, and components 
of a covector with upper indices, because this helps to ensure that mathe- 
matically meaningful expressions such as (4.1) will always follow our index 
conventions: Any index that is to be summed over in a given term appears 
twice, once as a subscript and once as a superscript. 

Suppose V and W are vector spaces and A: V — W is a linear map. We 
define a linear map A*: W* — V%, called the dual map or transpose of A, 
by 


(A*w)(X) =w(AX), forweWw*,X EV. 


Exercise 4.2. Show that A*w is actually a linear functional on V, and 
that A* is a linear map. 


Proposition 4.2. The dual map satisfies the following properties. 
(a) (Ao B)* = B*o A*. 
(b) Id*: V* > V* is the identity map of V*. 
Exercise 4.3. Prove the preceding proposition. 


(For those who are familiar with the language of category theory, this 
proposition can be summarized by saying that the assignment that sends 
a vector space to its dual space and a linear map to its dual map is a 
contravariant functor from the category of real vector spaces to itself. See, 
for example, [Lee00, Chapter 7].) 

Aside from the fact that the dimension of V* is the same as that of V, 
the next most important fact about the dual space is the following. 


Proposition 4.3. Let V be a finite-dimensional vector space. There is a 
canonical (basis-independent) isomorphism between V and its second dual 


space V** = (V*)*. 


Proof. Given a vector X € V, define a linear functional X on V* by 


X(w)=w(X), forweV*. 


It is easy to check that X(w) depends linearly on w, so that X € V™, 
and that the map X t> X is linear from V to V**. To show that it is an 
isomorphism, it suffices for dimensional reasons to verify that it is injective. 
Suppose X € V is not zero. Extend X to a basis (X = Fj,..., F,) for V, 
and let (e',...,€”) denote the dual basis for V*. Then 


X(e") = e*'(X) = e'(F1) = 140, 


so X £0. 
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Because of this proposition, the real number w(X) obtained by applying 
a covector w to a vector X is sometimes denoted by either of the notations 
(w, X) or (X,w); both expressions can be thought of either as the action of 
the covector w € V* on the vector X € V, or as the action of the covector 
X € V™ on the element w € V*. 

There is also a symmetry between bases and dual bases of a finite- 
dimensional vector space V: Any basis for V determines a dual basis for 
V*, and conversely any basis for V* determines a dual basis for V** = V. 
It is easy to check that if (c’) is the basis for V* dual to a basis (£;) for 
V, then (F;) is the basis dual to (e"). 


Tangent Covectors on Manifolds 


Now let M be a smooth manifold. For each p € M, we define the cotangent 
space at p, denoted by TM, to be the dual space to T,M: 


T*M = (TpM)*. 


Elements of TM are called tangent covectors at p, or just covectors at p. 
If (a) are local coordinates on an open subset U C M, then for each 

p € U, the coordinate basis (0/0x"|p) gives rise to a dual basis (¢%,). Any 

covector € € TM can thus be written uniquely as € = &¢),, where 


a) 
ene( 2 p 


Suppose now that (Z/) are another set of coordinates whose domain over- 
laps U, and let (€,) denote the basis for TM dual to (0/0%1|,). We can 
compute the components of the same covector € with respect to the new co- 
ordinate system as follows. First observe that the computations in Chapter 
3 show that the coordinate vector fields transform as follows: 


a _ oF p) 
@x'| Oat © Ogi 


Pp 


(4.2) 


Pp 


Writing € in both systems as 
C= te = te, 


we can use (4.2) to compute the components €; in terms of &;: 


fe) Ox ) 
Se (2 ) — (Fw Ox P 


The Cotangent Bundle 69 


As we mentioned in Chapter 3, in the early days of smooth manifold 
theory, before most of the abstract coordinate-free definitions we are using 
were developed, mathematicians tended to think of a tangent vector at a 
point p as an assignment of an n-tuple of real numbers to each coordinate 
system, with the property that the n-tuples (X!,...,X") and (X!,...,X”) 
assigned to two different coordinate systems (x’) and (Z/) were related by 
the transformation law that we derived in Chapter 3: 

Ox) 


Xi= a (p) X?. 


Similarly, a tangent covector was thought of as an n-tuple (£1,...,&,) that 
transforms, by virtue of (4.3), according to the following slightly different 
rule: 

_ OF x 


= 0%. (4.4) 


Since the transformation law (4.2) for the coordinate partial derivatives 
follows directly from the chain rule, it can be thought of as fundamental. 
Thus it became customary to call tangent covectors covariant vectors be- 
cause their components transform in the same way as (“vary with”) the 
coordinate partial derivatives, with the Jacobian matrix (Ox/ /Ox*) multi- 
plying the objects associated with the “new” coordinates (z/) to obtain 
those associated with the “old” coordinates (x*). Analogously, tangent vec- 
tors were called contravariant vectors, because their components transform 
in the opposite way. (Remember, it was the component n-tuples that were 
thought of as the objects of interest.) Admittedly, it does not make a lot of 
sense, but by now the terms are well entrenched, and we will see them again 
in Chapter 8. Note that this use of the terms covariant and contravariant 
has nothing to do with the covariant and contravariant functors of category 
theory. 


bi 


The Cotangent Bundle 


The disjoint union 
T*M= || TM 
pEM 
is called the cotangent bundle of M. 


Proposition 4.4. The cotangent bundle of a smooth manifold has a nat- 
ural structure as a vector bundle of rank n over M. 


Proof. The proof is essentially the same as the one we gave for the tangent 
bundle. Let 7: T*M — M be the natural projection that sends € € T; M 
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to p € M. Given a coordinate chart (U,(x‘)) on M, we define a chart 
®: r—1(U) — R” by 


®(Eie4) = (2"(p), ae , a” (p), Et, tee »En); 


where (x!(p),...,2"(p)) is the coordinate representation of p € M, and (¢’,) 
is the dual coordinate basis for TM. (In this situation, we must forego our 
insistence that coordinate functions have upper indices, because the fiber 
coordinates €; are already required by our index conventions to have lower 
indices. ) 

If two charts (U,(a*)) and (U, ()) overlap, then clearly % are smooth 
functions of (x',...,2"), and (4.3) can be solved for é; (by invert- 
ing the Jacobian matrix) to show that &; is a smooth function of 
(xt,...,2",&1,...,€). The arguments of Lemmas 3.12 and 3.13 then ap- 
ply almost verbatim to give T*M the structure of a smooth vector bundle 
over M. 


A section of T*M is called a covector field on M. As we did with vector 
fields, we will write the value of a covector field o at a point p € M as 
op instead of o(p), to avoid conflict with the notation for the action of 
a covector on a vector. If (ef) is the dual coordinate basis for T,M at 
each point p in some open set U Cc M, then o can be expressed locally as 
Op = 0% (pet, for some functions 01,...,0,: U — R, called the component 
functions of o. 

A covector field is said to be smooth if it is smooth as a map from M to 
T*M. Smooth covector fields are called (differential) 1-forms. (The reason 
for the latter terminology will become clear in Chapter 9, when we define 
differential k-forms for k > 1.) 

Just as in the case of vector fields, there are several ways to check for 
smoothness of a covector field. The proof is quite similar to the proof of 
the analogous fact for vector fields (Lemma 3.14). 


Lemma 4.5. Let M be a smooth manifold, and let o: M — T*M be a 
map (not assumed to be continuous) such that op € TM for each pe M. 
The following are equivalent. 


(a) o is smooth. 
(b) In any coordinate chart, the component functions o; of o are smooth. 


(c) If X is a smooth vector field defined on any open subset U C M, then 
the function (0, X): U > R, defined by 


(0, X)(p) = (op, Xp) = Fp(Xp) 
is smooth. 


Exercise 4.4. Prove Lemma 4.5. 
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An ordered n-tuple of smooth covector fields (o1,...,0”) defined on some 
open set U C M is called a local coframe on U if (04) forms a basis for 
TM at each point p € U. If U = M, it is called a global coframe. (A local 
coframe is just a local frame for T*M, in the terminology introduced in 
Chapter 3.) Given a local frame (F,...,£,) for TM over an open set U, 
there is a uniquely determined smooth local coframe (<1,...,€”) satisfying 
e'(E;) = 6%. It is smooth by part (c) of the preceding lemma. This coframe 
is called the dual coframe to the given frame. 

Note that Lemma 4.5(b) implies, in particular, that each coordinate cov- 
ector field e' (whose value at p is the ith dual basis element ¢7,) is smooth, 
because each of its component functions is either identically 1 or identically 
0. Thus (e!,...,€”) is a smooth local coframe on the coordinate domain, 
called a coordinate coframe. 

We denote the set of all smooth covector fields on M by J*(M). Given 
smooth covector fields o,7 € J*(M), any linear combination ao + br with 
real coefficients is obviously again a smooth covector field, so T*(/) is a 
vector space. Moreover, just like vector fields, covector fields can be mul- 
tiplied by smooth functions: If f € C°(M) and o € J*(M), we define a 
covector field fo by 


(fo)p = f(P)op- (4.5) 


A simple verification using either part (b) or part (c) of Lemma 4.5 shows 
that fo is smooth. Thus J*(1/) is a module over C™(M). 

Geometrically, we think of a vector field on M as a rule that attaches 
an arrow to each point of M. What kind of geometric picture can we 
form of a covector field? The key idea is that a nonzero linear functional 
€ € TM is completely determined by two pieces of data: its kernel, which 
is a codimension-1 linear subspace of T,M (a hyperplane), and the set of 
vectors X for which €(X) = 1, which is an affine hyperplane parallel to 
the kernel. (Actually, the set where €(X) = 1 alone suffices, but it is useful 
to visualize the two parallel hyperplanes.) The value of €(X) for any other 
vector X is then obtained by linear interpolation or extrapolation. 

Thus you can visualize a covector field as defining a pair of affine hyper- 
planes in each tangent space, one through the origin and another parallel to 
it, and varying smoothly from point to point. At points where the covector 
field takes on the value zero, one of the hyperplanes goes off to infinity. 


The Differential of a Function 


In elementary calculus, the gradient of a smooth function f on R” is defined 
as the vector field whose components are the partial derivatives of f. Un- 
fortunately, in this form, the gradient does not make coordinate-invariant 
sense. 
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Exercise 4.5. Let f(x,y) =x on R?, and let X be the vector field 
) 
X = grad f= —. 
grad f an 


Compute the coordinate expression of X in polar coordinates (on some open 
set on which they are defined) using (4.2) and show that it is not equal to 


ata , af a 
Or Or — 00 00° 

The most important use of covector fields is to define a coordinate- 
invariant analogue of the gradient. 

Let f be a smooth function on a manifold M. (As usual, all of this 
discussion applies to functions defined on an open subset U C M, simply 
by replacing M by U throughout.) We define a covector field df, called the 
differential of f, by 


df,(Xp) =Xpf for Xp € TpM. 


Lemma 4.6. The differential of a smooth function is a smooth covector 


field. 


Proof. First we need to verify that at each point p € M, df,(X,) depends 
linearly on X,, so that df, is indeed a covector at p. This is a simple 
computation: For any a,b € R and X,, Y, € T,>M, 


dfp(aXp + bY) 
= (aX, + bY,) f = a(Xpf) + (Ypf) = adfp(Xp) + bdfp(Y>). 


Next we show that df is smooth. Let (2*) be local coordinates on an open 
subset U Cc M, and let (e’) be the corresponding coordinate coframe on U. 
Writing df in coordinates as dfy = A;(p)é;, for some functions A;: U > R, 
the definition of df implies 

a) 
Gi Ox* 


Since this last expression depends smoothly on p, it follows that the com- 
ponent functions A; of df are smooth, so df is smooth. 


0 
Ox? 


f= oF ip) 


Pp 


Ai(p) = dfp ( 


One consequence of the preceding proof is a formula for the coordinate 
representation of df: 


_ Of 
~~ Oat 


df (p)ep- (4.6) 


Thus the components of df in any coordinate system are the partial deriv- 
atives of f with respect to those coordinates. Because of this, we can think 
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of df as an analogue of the classical gradient, reinterpreted in a way that 
makes coordinate-invariant sense on a manifold. 

If we apply (4.6) to the special case in which f is one of the coordinate 
functions 77: U > R, we find 


dal, = (p)e, = 5%, =e. 


In other words, the coordinate covector field € is none other than dx?! 
Therefore, the formula (4.6) for df, can be rewritten as 


of 


dfp = Fqi Pdap, 


or as an equation between covector fields instead of covectors: 


af= oF -dx’. (4.7) 
Ox* 
In particular, in the one-dimensional case, this reduces to 
d, 
df = oie 
dx 


Thus we have recovered the familiar classical expression for the differential 
of a function f in coordinates. Henceforth, we will abandon the notation 
e’ for the coordinate coframe, and use dx’ instead. 


Example 4.7. If f(x,y) = 2?ycosz on R?, then df is given by the formula 
O(x7y cos x) O(x?y cos x) 
df = 
Ox Oy 
= (2ry cosa — x*ysinax)dx + x cos x dy. 


dx 4 


dy 


Proposition 4.8 (Properties of the Differential). Let M be a 
smooth manifold, and let f,g € C°(M). 


For any constants a,b, d(af + bg) = adf + bdg. 
d(fg) = fdg + gdf. 
d(f/g) =(g df — f dg)/g? on the set where g # 0. 


If J C R ts an interval containing the image of f, and h: J — R is 
a smooth function, then d(ho f) = (h'o f) df. 


(e) If f is constant, then df = 0. 


Exercise 4.6. Prove Proposition 4.8. 
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One very important property of the differential is the following charac- 
terization of smooth functions with vanishing differentials. 


Proposition 4.9 (Functions with Vanishing Differentials). If f is 
a smooth function on a smooth manifold M, then df = 0 if and only if f 
is constant on each component of M. 


Proof. It suffices to assume that M is connected and show that df = 0 if 
and only if f is constant. One direction is immediate: If f is constant, then 
df = 0 by Proposition 4.8(e). Conversely, suppose df = 0, let p € M, and 
let C= {qe M: f(g) = f(p)}. If q is any point in C, let U be a connected 
coordinate domain centered at g. From (4.7) we see that Of /Ox' = 0 in U 
for each 7, so by elementary calculus f is constant on U. This shows that 
€ is open, and since it is closed by continuity it must be all of M. Thus f 
is everywhere equal to the constant f(p). 


In elementary calculus, one thinks of df as an approximation for the 
small change in the value of f caused by small changes in the independent 
variables x’. In our present context, df has the same meaning, provided we 
interpret everything appropriately. Suppose that f is defined and smooth 
on an open subset U C R”, and let p be a point in U. Recall that dz’, is 
the linear functional that picks out the 7th component of a tangent vector 
at p. Writing Af = f(p+v) — f(p) for v € R”, Taylor’s theorem shows 
that Af is well approximated when v is small by 


i of (p)dax,,(v) = dfp(v). 
In other words, dfp is the linear functional that best approximates Af near 
p. The great power of the concept of the differential comes from the facts 
that we can define df invariantly on any manifold, and can do so without 
resorting to any vague arguments involving infinitesimals. 

The next result is an analogue of Proposition 3.11 for the differential. 


Proposition 4.10. Suppose y: J — M is a smooth curve and f: M — 
R is a smooth function. Then the derivative of the real-valued function 
foy: RR its given by 


(f07)'(t) = df (y'"(). (4.8) 
Proof. Directly from the definitions, for any to € J, 
df(t) (Y ‘(to)) = (to) f (definition of df) 
= ( Ya a. )f (definition of y’(t)) 
dt |, 

d ne ; 
=%—| (fo) (definition of +,.) 

dt |, 
= (fo7)'(to) (definition of (f 0 7)’), 
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which was to be proved. 


It is important to observe that for a smooth real-valued function f: M —- 
R, we have now defined two different kinds of derivative of f at a point 
p € M. In the preceding chapter, we defined the push-forward f, as a linear 
map from TM to T(p)R. In this chapter, we defined the differential dfp as a 
covector at p, which is to say a linear map from T;,M to R. These are really 
the same object, once we take into account the canonical identification 
between R and its tangent space at any point; one easy way to see this is 
to note that both are represented in coordinates by the row matrix whose 
components are the partial derivatives of f. 

Similarly, if 7 is a smooth curve in M, we have two different meanings 
for the expression (f o y)/(t). On the one hand, f oy can be interpreted 
as a smooth curve in R, and thus (f o 7)’(t) is its tangent vector at the 
point fo y(t), an element of the tangent space T’p,.(¢)R. Proposition 3.11 
shows that this tangent vector is equal to f.(7/(t)). On the other hand, 
fo7 can also be considered simply as a real-valued function of one real 
variable, and then (f o7)’(t) is just its ordinary derivative. Proposition 4.10 
shows that this derivative is equal to the real number df,(¢)(y'(t)). Which 
of these interpretations we choose will depend on the purpose we have in 
mind. 


Pullbacks 


As we have seen, a smooth map yields a linear map on tangent vectors 
called the push-forward. Dualizing this leads to a linear map on covectors 
going in the opposite direction. 

Let F: M — N be a smooth map, and let p € M be arbitrary. The 
push-forward map 


Fy: TypM > Trea) N 
yields a dual map 
(F.)": TiN > To M. 


To avoid a proliferation of stars, we write this map, called the pullback 
associated with F’, as 


Unraveling the definitions, F* is characterized by 


(F*é)(X) =€(F.X), for €€ TiN, X € TM. 
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When we introduced the push-forward map, we made a point of noting 
that vector fields do not push forward to vector fields, except in the special 
case of a diffeomorphism. The surprising thing about pullbacks is that they 
always pull smooth covector fields back to smooth covector fields. Given 
a smooth map G: M — N and a smooth covector field o on N, define a 
covector field G*o on M by 


(G"0)p = G* (cay): (4.9) 


Observe that there is no ambiguity here about what point to pull back 
from, in contrast to the vector field case. We will prove in Proposition 4.12 
below that G*o is smooth. Before doing so, let us examine two important 
special cases. 


Lemma 4.11. Let G: M — N be a smooth map, and suppose f € C®(N) 
anda € J*(N). Then 


G*df = d(f 0G); (4.10) 
G* (fo) = (f oG)G*o. (4.11) 


Proof. To prove (4.10), we let X, € T,M be arbitrary, and compute 


(G*df) (Xp) = (G*feqy)(Xp) (by (4.9)) 
= dfg(p)(G+Xp) (by definition of G*) 
Bee Or (by definition of df) 
= X,(f 0G) (by definition of G..) 
= d(f oG)p(Xp) (by definition of d(f o G)). 


Similarly, for (4.11), we compute 


cheers = G*((fo)ewp) (by (4.9)) 
G*(f(G(P))eaw@)) (by (4.5)) 

f(G(p))G* (a(n) (because G* is linear) 
f(G(P))(G*o)p (by (4.9)) 
= 2s G)G"o)p (by (4.5)), 


which was to be proved. 


Proposition 4.12. Suppose G: M — N is smooth, and let o be a smooth 
covector field on N. Then G*o is a smooth covector field on M. 


Proof. Let p € M be arbitrary, and choose local coordinates (x') for M 
near p and (y/) for N near G(p). Writing o in coordinates as 0 = oj;dy! 
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for smooth functions o; defined near G(p) and using Lemma 4.11 twice, we 
compute 
G*o = G*(o;dy’) 
= (0; 0 G)G* dy’ 
= (7; 0 G)d(y’ 0G). 


Because this expression is smooth, it follows that G*o is smooth. 


In the course of the preceding proof, we derived the following formula 
for the pullback of a covector field with respect to coordinates (x) on the 
domain and (y’) on the range: 


G*a = G* (aj;dy’) = (0; 0 G)d(y! o G) = (0; 0 G)dG!, (4.12) 


where G is the jth component function of G in these coordinates. This 
formula makes the computation of pullbacks in coordinates exceedingly 
simple, as the next example shows. 


Example 4.13. Let G: R? — R? be the map given by 
(u,0) = G(e,¥, 2) = (ay, ysin2), 
and let ¢ € J*(R?) be the covector field 
og =udv+vdu. 
According to (4.12), the pullback G*o is given by 


Go = (u ° G)d(vu ° G) + (v ° G)d(u io) G) 
= (x*y)d(ysin z) + (ysin z)d(x*y) 
= x y(sin z dy + ycos z dz) + ysin z(2ay dx + x? dy) 


= 2xy sin z dx + 2x? ysin z dy + 27 y? cos z dz. 


In other words, to compute G*o, all you need to do is substitute the 
component functions of G for the coordinate functions of N everywhere 
they appear in o! 

This also yields an easy way to remember the transformation law for a 
covector field under a change of coordinates. Again, an example will convey 
the idea better than a general formula. 


Example 4.14. Let (r,@) be polar coordinates on, say, the upper half- 
plane H = {(x,y) : y > O}. We can think of the change of coordinates 
(x,y) = (rcos6@,rsin@) as the coordinate expression for the identity map 
of H, but using (r,@) as coordinates for the domain and (2,y) for the 
range. Then the the pullback formula (4.12) tells us that we can compute 
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the polar coordinate expression for a covector field simply by substituting 
x=rcosé, y=rsin9@. For example, 


xdx + y dy = 1d* (ada + ydy) 
= (rcos@)d(r cos 6) + (r sin 0)d(r sin @) 
= (rcos@)(cos 0 dr — rsin @d@) + (r sin 0)(sin 6 dr + r cos 0 d6) 
= (rcos? @ + rsin? 0)dr + (—r? cos 6 sin @ + r? sin 6 cos 0)d0 


=rdr. 


Line Integrals 


Another important application of covector fields is to make coordinate- 
independent sense of the notion of a line integral. 

We begin with the simplest case: an interval in the real line. Suppose 
[a,b] C R is a compact interval, and w is a smooth covector field on {a, 6]. 
(This means that the component function of w admits a smooth extension 
to some neighborhood of [a, b].) If we let ¢ denote the standard coordinate 
on R, w can be written w, = f(t) dt for some smooth function f: [a,b] > 
R. The similarity between this and the standard notation { f(t) dt for an 
integral suggests that there might be a connection between covector fields 
and integrals, and indeed there is. We define the integral of w over [a,b] to 


be 
i = i. " f(t) de 


The next proposition indicates that this is more than just a trick of nota- 
tion. 


Proposition 4.15 (Diffeomorphism Invariance of the Integral). 
Let w be a smooth covector field on the compact interval [a,b] C R. If 
y: [c,d] — [a,b] is an increasing diffeomorphism (meaning that t < t’ 


implies p(t) < y(t’)), then 
‘i pw =} Ww. 
[c,d] [a,b] 


Proof. If we let s denote the standard coordinate on [c,d] and ¢ that on 
[a, b], then (4.12) shows that the pullback y*w has the coordinate expres- 
sion (y*w), = f(y(s))y’(s) ds. Inserting this into the definition of the line 
integral and using the change of variables formula for ordinary integrals, 
we obtain 


hes gun [Keyes is= f 1 af Bs 
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which was to be proved. 


Exercise 4.7. If y: [c,d] — [a,b] is a decreasing diffeomorphism, show 
that Sie.al pw =— Sia.b) Ww. 


Now let M be a smooth manifold. By a curve segment in M we mean 
a continuous curve y: [a,b] — M whose domain is a compact interval. 
It is a smooth curve segment if it is has a smooth extension to an open 
set containing [a,b]. A piecewise smooth curve segment is a curve segment 
y: [a,b] — M with the property that there exists a finite subdivision 
a= a9 < a <-++: < ay = 6 of [a,b] such that ¥\[c,_,,a,] is smooth for 
each i. Continuity of y means that 7(¢) approaches the same value as t 
approaches any of the points a; (other than ap or ax) from the left or the 
right. Smoothness of 7 on each subinterval means that y has one-sided tan- 
gent vectors at each such a; when approaching from the left or the right, 
but these one-sided tangent vectors need not be equal. 


Lemma 4.16. If M is a connected smooth manifold, any two points of M 
can be joined by a piecewise smooth curve segment. 


Proof. Let p be an arbitrary point of M, and define a subset C C M 
by € = {q € M : there is a piecewise smooth curve in M from p to gq}. 
Clearly p € €, so © is nonempty. To show € = M, we need to show it is 
open and closed. 

Let q € © be arbitrary, which means that there is a piecewise smooth 
curve segment y going from p to q. Let U be a coordinate ball centered at q. 
If q’ is any point in U, then it is easy to construct a piecewise smooth curve 
segment from p to q’ by first following y from p to q, and then following 
a straight-line path in coordinates from gq to q’. Thus U C @, which shows 
that € is open. On the other hand, if qg € OC, let U be a coordinate ball 
around gq as above. The fact that qg is a boundary point of C means that 
there is some point q’ € CN U. In this case, we can construct a piecewise 
smooth curve from p to q by first following one from p to q’ and then 
following a straight-line path in coordinates from q’ to q. This shows that 
gq € @, so @ is also closed. 


If y is asmooth curve segment in M and w is a smooth covector field on 
M, we define the line integral of w over y to be the real number 


feo-f yw. 
¥ [a,b] 


Because y*w is a smooth covector field on [a, b], this definition makes sense. 
More generally, if 7 is piecewise smooth, we define 


[e- 


k 


Pe / yw, 
[ai-1,a:] 


i=1 
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where [aj-1,@;], i =1,...,k, are the intervals on which y is smooth. 

This definition gives a rigorous meaning to classical line integrals such 
as Jf, Pdet+ Q dy in the plane or [, Pda + Qdy+ Rdz in R®. 
Proposition 4.17 (Properties of Line Integrals). Let M be a 
smooth manifold. Suppose y: [a,b] — M is a piecewise smooth curve 


segment and w,w1,w2 € T*(M). 


(a) For any c1,c2 € R, 


[lever + exw) =o f witer fu. 
7 + + 


(b) If y is a constant map, then fw =0. 


(c) Ifa<c<b, then 


where 71 = Y\{a,q and Y2 = Y\I[c,0}- 


(d) The line integral of w over y can also be expressed as the ordinary 


integral 
b 
fo-] wy¢ay (y(t) at. 
Y a 


Exercise 4.8. Prove Proposition 4.17. 


Example 4.18. Let M = R?~ {0}, let w be the covector field on M given 
by 


_ ady—ydz 

_ ge + y? ’ 

and let y: [0,27] — M be the curve segment defined by 
+(t) = (cos t, sint). 


Since y*w can be computed by substituting « = cost and y = sint every- 
where in the formula for w, we find that 


t(cos t dt) — sin t(— sin t dt se 
7 [0,27] sin’ t+ cos? t ) 
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One of the most significant features of line integrals is that they are in- 
dependent of parametrization, in a sense we now define. If y: [a,b] ~ M 
and ¥: [c,d] > M are smooth curve segments, we say that ¥ is a reparam- 
etrization of y if ¥ = yo » for some diffeomorphism y: [c,d] — [a,b]. If y 
is an increasing function, we say ¥ is a forward reparametrization, and if y 
is decreasing, it is a backward reparametrization. (More generally, one can 
allow y to be piecewise smooth, but we will have no need for this extra 
generalization.) 


Proposition 4.19 (Parameter Independence of Line Integrals). 
Suppose M is a smooth manifold, w is a smooth covector field on M, and 
7 is a piecewise smooth curve segment in M. For any reparametrization ¥ 
of y, we have 


[o- 


Proof. First assume that 7: [a,b] — M is smooth, and suppose ¢: [c, d] > 
(a, b] is an increasing diffeomorphism. Then Proposition 4.15 implies 


/ w ify is a forward reparametrization, 
7 


- / w ify is a backward reparametrization. 
7 


When y is decreasing, the analogous result follows from Exercise 4.7. If ¥ is 
only piecewise smooth, the result follows simply by applying the preceding 
argument on each subinterval where y is smooth. 


Exercise 4.9. Suppose F': M — N is any smooth map, w € J*(N), and 
7 is a piecewise smooth curve segment in M. Show that 


fre-f Ww. 
pi Foy 


There is one special case in which a line integral is trivial to compute: 
the line integral of a differential. 


Theorem 4.20 (Fundamental Theorem for Line Integrals). Let 
M be a smooth manifold. Suppose f is a smooth function on M and 
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y: [a,b] + M is a piecewise smooth curve segment in M. Then 
[ #=10)- fo@). 
m 


Proof. Suppose first that 7 is smooth. By Proposition 4.10 and Proposition 
4.17(d), 


[a= [ too'ma= [rove dt. 


By the one-variable version of the fundamental theorem of calculus, this is 
equal to fo y(b) — fo y(a). 

If y is merely piecewise smooth, let a = ag < --- < ax = b be the 
endpoints of the subintervals on which 7 is smooth. Applying the above 
argument on each subinterval and summing, we find that 


k 
[= (60) = fla) = Fo) - fla), 


because the contributions from all the interior points cancel. 


Conservative Covector Fields 


Theorem 4.20 shows that the line integral of any covector field w that can be 
written as the differential of a smooth function can be computed extremely 
easily once the smooth function is known. For this reason, there is a special 
term for covector fields with this property. We say a smooth covector field w 
on amanifold M is exact on M if there is a function f € C°(M) such that 
w = df. In this case, the function f is called a potential for w. The potential 
is not uniquely determined, but by Lemma 4.9, the difference between any 
two potentials for w must be constant on each component of M. 

Because exact differentials are so easy to integrate, it is important to 
develop criteria for deciding whether a covector field is exact. Theorem 
4.20 provides an important clue. It shows that the line integral of an exact 
covector field depends only on the endpoints p = y(a) and g = 7(b): Any 
other curve segment from p to q would give the same value for the line 
integral. In particular, if 7 is a closed curve segment, meaning that y(a) = 
4(b), then the integral of df over y is zero. 

We say a smooth covector field w is conservative if the line integral of w 
over any closed piecewise smooth curve segment is zero. This terminology 
comes from physics, where a force field is called conservative if the change 
in energy caused by the force acting along any closed path is zero (“energy 
is conserved” ). (In elementary physics, force fields are usually thought of as 
vector fields rather than covector fields; see Problem 4-5 for the connection.) 
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The following lemma gives a useful alternative characterization of con- 
servative covector fields. 


Lemma 4.21. A smooth covector field w is conservative if and only if the 
line integral of w depends only on the endpoints of the curve, 1.e., fw = 
iw whenever y and ¥ are piecewise smooth curve segments with the same 
starting and ending points. 


Exercise 4.10. Prove Lemma 4.21. [Observe that this would be much 
harder to prove if we defined conservative fields in terms of smooth curves 
instead of piecewise smooth ones.] 


Theorem 4.22. A smooth covector field is conservative if and only if it is 
exact. 


Proof. Ifw € J*(M) is exact, Theorem 4.20 shows that it is conservative, so 
we need only prove the converse. Suppose therefore that w is conservative, 
and assume for the moment that M is connected. Because the line integrals 
of w are path independent, we can adopt the following notation: For any 
points p,q € M, we will use the notation Spe to denote the line integral 
ake w, where y is any piecewise smooth curve segment from p to g. Observe 
that Proposition 4.17(c) implies that 


P2 P3 P3 
/ w + / w= / w (4.13) 
PL p2 Pl 


for any three points pi, p2,p3 € M. 
Now choose any base point po € M, and define a function f: M — R by 


fa= fw 


We will show that df = w. To accomplish this, let gg € M be an arbitrary 
point, let (U,(x’)) be a coordinate chart centered at go, and write the 
coordinate representation of w as w = w;dx'. We will show that 


oF (a) = Wi (qo) 


for 7 =1,...,n, which implies that dfg, = way. 

Fix j, and let 7: [—¢,e] ~ U be the smooth curve segment defined in 
coordinates by y(t) = (0,...,¢,...,0), with ¢ in the jth place, and with ¢ 
chosen small enough that y[—¢,¢] C U. Let p; = y(—e), and define a new 
function f: M => R by f(q) = i w. Note that (4.13) implies 


fa -sa= fw [w= fw 
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which does not depend on q. Thus fand f differ by a constant, so it suffices 
to show that Of /Ox! (qo) = w; (qo). 
Now y‘(t) = 0/02/|() by construction, so 


en") = ( 35 


Since the restriction of y to [—<,t] is a smooth curve from p; to y(t), we 
have 


Thus by the fundamental theorem of calculus, 


SF (40) = (O)F 
A 7 
gery moeee y(t) 
d t 
Pie - [ sttenes 
= w;(7(0)) = wj (40). 


This completes the proof that df = w. 

Finally, if 7 is not connected, let {14;} be the components of M. The 
argument above shows that for each i there is a smooth function f; © 
C™(M;) such that df; = w on M;. Letting f: M@ — R be the function that 
is equal to f; on M;, we have df = w, thus completing the proof. 


It would be nice if every smooth covector field were exact, for then the 
evaluation of any line integral would just be a matter of finding a potential 
function and evaluating it at the endpoints, a process analogous to eval- 
uating an ordinary integral by finding an indefinite integral or primitive. 
However, this is too much to hope for. 


Example 4.23. The covector field w of Example 4.18 cannot be exact on 
R? \ {0}, because it is not conservative: The computation in that example 
showed that fw = 2a # 0, where ¥ is the unit circle traversed counter- 
clockwise. 


Because exactness has such important consequences for the evaluation 
of line integrals, we would like to have an easy way to check whether a 
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given covector field is exact. Fortunately, there is a very simple necessary 
condition, which follows from the fact that partial derivatives of smooth 
functions can be taken in any order. 

To see what this condition is, suppose that w is exact. Let f be any 
potential function for w, and let (U,(a’)) be any coordinate chart on M. 
Because f is smooth, it satisfies the following identity on U: 


O7f 7 07 f 
OxtOxi ~~ OxI Oat’ 


(4.14) 


Writing w = wid in coordinates, the fact that w = df is equivalent to 
w; = Of /Ozx'. Substituting this into (4.14), we find that the component 
functions of w satisfy 


Ox? Oaxd 


(4.15) 


We say that a smooth covector field w is closed if its components in 
every coordinate chart satisfy (4.15). The following lemma summarizes the 
computation above. 


Lemma 4.24. Every exact covector field is closed. 


The significance of this result is that the property of being closed is one 
that can be easily checked. First we need the following result, which says 
that it is not necessary to check the closedness condition in every coordinate 
chart, just in a collection of charts that cover the manifold. The proof of this 
lemma is a tedious computation; later you will be able to give a somewhat 
more conceptual proof (see Problem 4-4 and also Chapter 9), so you are 
free to skip the proof of this lemma if you wish. 


Lemma 4.25. A smooth covector field is closed if and only if it satisfies 
(4.15) in some coordinate chart around every point. 


Proof. If w is closed, then by definition it satisfies (4.15) in every coordi- 
nate chart. Conversely, suppose (4.15) holds in some chart around every 
point, and let (U,(z*)) be an arbitrary coordinate chart. For each p € U, 
the hypothesis guarantees that there are some coordinates (%/) defined 
near p in which the analogue of (4.15) holds. Using formula (4.4) for the 
transformation of the components of w together with the chain rule, we 
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find 
Ow; = Ow; = 0 Or" 0 Ox" 
axi Ox’ ~~ Oa \ Oxi” ax? \ Ox7 ~* 
= Cea re Oz" OB, Oeg* a OE" OD, 
~ \ Oriani” Oat Oxi axiaai *T Oxi Axi 


Oak awk ag! ab, = ?* _ OF aa! OD, 
~ Balai" Oat dai Oe Axial" ~ Bai Oat OF! 
(Pa PRE \ ARK AF (AD, — AB 
i (S50 sama a Ox Jad (3 oa 
= O46, 


where the fourth equation follows from the third by interchanging the roles 
of k and J in the last term. 


For example, consider the following covector field on R?: 
w= ycos ry dx + xcos ry dy. 
It is easy to check that 


ra) O(x cos 
(y cos vy) _ (x xy) =cosry — rysin zy, 


Oy Ox 


so w is closed. In fact, you might guess that w = d(sin xy). 

The question then naturally arises whether the converse of Lemma 4.24 is 
true: is every closed covector field exact? The answer is almost yes, but there 
is an important restriction. It turns out that the answer to the question 
depends in a subtle way on the shape of the domain, as the next example 
illustrates. 


Example 4.26. Look once again at the covector field w of Example 4.7. 
A straightforward computation shows that w is closed; but as we observed 
above, it is not exact on R? \ {0}. On the other hand, if we restrict the 
domain to the right half-plane U = {(a,y) : > 0}, a computation shows 
that w = d(tan~! y/x) there. This can be seen more clearly in polar coordi- 
nates, where w = d@. The problem, of course, is that there is no smooth (or 
even continuous) angle function on all of R? \ {0}, which is a consequence 
of the “hole” in the center. 


This last example illustrates a key fact: The question of whether a par- 
ticular covector field is exact is a global one, depending on the shape of the 
domain in question. This observation is the starting point for de Rham co- 
homology, which expresses a deep relationship between smooth structures 
and topology. We will pursue this relationship in more depth in Chapter 
11, but for now we can prove the following result. A subset V C R” is said 
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to be star-shaped with respect to a point c € V if for every x € V, the 
line segment from c to z is entirely contained in V. For example, a convex 
subset is star-shaped with respect to each of its points. 


Proposition 4.27. If M is diffeomorphic to a star-shaped open subset of 
R”, then every closed covector field on M is exact. 


Proof. It is easy to check that a diffeomorphism pulls back closed covector 
fields to closed covector fields and exact covector fields to exact ones; thus 
it suffices to prove the proposition when WM actually is a star-shaped open 
subset of R”. So suppose M C R” is star-shaped with respect to c € M, 
and let w = w,;dz' be a closed covector field on M. 

As in the proof of Theorem 4.22, we will construct a potential function 
for w by integrating along smooth curve segments from c. However, in this 
case we do not know a priori that the line integrals are path-independent, 
so we must integrate along specific paths. 

For any point x € M, let y.: [0,1] — M denote the line segment from c 
to x, parametrized as follows: 


a(t) =c+t(x—c). 


The hypothesis guarantees that the image of 7z lies entirely in M for each 
x € M. Define a function f: M — R by 


(0) = fw. 


© 


We will show that f is a potential for w, or equivalently that Of /Ox* = w; 
fori =1,...,n. To begin, we compute 


f(a) = | wy, (7 (t)) at 
=| En Ce Cm ec ney Fe 


To compute the partial derivatives of f, we note that the integrand is 
smooth in all variables, so it is permissible to differentiate under the integral 
sign to obtain 


5a) = (+5 (c+ t(a —¢))(a2* -—c) +w;(c+ t(a c)))at 
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Because w is closed, this reduces to 


i) -[ (Fete eo =e) esa c)))at 


a f © (tole + t(x — c))) dt 


= & (c+ (a — ) 


= w; (2), 


t=0 


which was to be proved. 


The key to the above construction is that we can reach every point 
xz € M by a definite path y, from c to x, chosen in such a way that yz 
varies smoothly as x varies. That is what fails in the case of the closed 
covector field w on the punctured plane (Example 4.23): Because of the 
hole, it is impossible to choose a smoothly-varying family of paths starting 
at a fixed base point and reaching every point of the domain. In Chapter 
11, we will generalize Proposition 4.27 to show that every closed covector 
field is exact on any simply connected manifold. 

When you actually have to compute a potential function for a given 
covector field that is known to be exact, there is a much simpler procedure 
that almost always works. Rather than describe it in complete generality, 
we illustrate it with an example. 


Example 4.28. Let w be a smooth covector field on R*, say 
2 2 
w =e dx + 2xye” dy — 2zdz. 


You can check that w is closed. If f is a potential for w, we must have 


Of _ y? 
de 
Of 2 
CF daa e¥ 
dy rye” , 
Of _ 

Aer ee 


Holding y and z fixed and integrating the first equation with respect to 2, 
we obtain 


Foua= / Pa BOGS, 
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where the “constant” of integration C\(y, z) may depend on the choice of 
(y, z). Now the second equation implies 


which forces 0C;/Oy = 0, so C; is actually a function of z only. Finally, 
the third equation implies 


a) 
-2z= 5 (re + C,(z)) 
_ C4 
S Oe 
from which we conclude that C\(z) = —2z? + C, where C is an arbitrary 


constant. Thus a potential function for w is given by f(x,y, z) = zed —22?. 
Any other potential differs from this one by a constant. 


You should convince yourself that the formal procedure we followed in 
this example is equivalent to choosing an arbitrary base point c € R®, 
and defining f(z,y,z) by integrating w along a path from c to (x,y, z) 
consisting of three straight line segments parallel to the axes. This works 
for any closed covector field defined on an open rectangle in R” (which we 
know must be exact, because a rectangle is convex). In practice, once a 
formula is found for f on some open rectangle, the same formula typically 
works for the entire domain. (This is because most of the covector fields 
for which one can explicitly compute the integrals as we did above are real- 
analytic, and real-analytic functions are determined by their behavior in 
any open set.) 
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Problems 


4-1. In each of the cases below, M is a smooth manifold and f: M—R 
is a smooth function. Compute the coordinate representation for df, 
and determine the set of all points p € M at which df, = 0. 


(a) M = {(z,y) € R?: « > 0}; f(a,y) = x/(x? +-y”). Use standard 
coordinates (2, y). 


(b) M and f areas in part (a); this time use polar coordinates (r, 6). 


(c) M=S? CR®; f(p) = z(p) (the z-coordinate of p, thought of as 
a point in R®). Use stereographic coordinates. 


(d) M=R"; f(x) = |z|?. Use standard coordinates. 
4-2. Let M be a smooth manifold. 


(a) Given a smooth covector field o on M, show that the map 
a: J(M) — C*(M) defined by 


o(X)(p) = op(Xp) 


is linear over C(I), in the sense that for any smooth functions 
f, f’ € C@(M) and smooth vector fields X,X’, 


a(fX + f'X') = fa(X) + fla(X’). 
(b) Show that a map 
a: T(M) > C™(M) 


is induced by a smooth covector field as above if and only if it 
is linear over C'™°(M). 


4-3. The length of a smooth curve y: [a,b] — R” is defined to be the value 
of the (ordinary) integral 


b 
L(y) = / Iy'(t)| at. 


Show that there is no smooth covector field w € J*(R") with the 
property that pe w = L(y) for every smooth curve y. 


4-4. Use Proposition 4.27 to give a simpler proof of Lemma 4.25. 


4-5. LINE INTEGRALS OF VECTOR FIELDS: Suppose X is a smooth vector 
field on an open set U C R”, thought of as a smooth function from 


4-6. 


4-8. 


4-9. 


4-10. 
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R” to R”. For any piecewise smooth curve segment y: [a,b] > U, 
define the line integral of X over y by 


[x ds = [ xow) -y'(t) dt, 


and say X is conservative if its line integral around any closed curve 
is zero. 


(a) Show that X is conservative if and only if there exists a smooth 
function f € C®(U) such that X = grad f. [Hint: Consider 
the covector field w,(Y) = X(x) -Y, where the dot denotes the 
Euclidean dot product.] 


(b) If n =3 and X is conservative, show curl X = 0, where 


Ox? @x?\ Od OX! Ax3\ A 
ee (= Ox? ) Ox! 3 (Fs Ox ) Ox? 
ROR, OR Vd 
(Fs Ox? ) Ox” 


(c) If U C R° is star-shaped, show that X is conservative on U if 
and only if curl X = 0. 


If M is a compact manifold and f € C°°(M), show that df vanishes 
somewhere on M. 


. Is there a smooth covector field on S? that vanishes at exactly one 


point? If so, can it be chosen to be exact? 


Let T” = S! x --- x S! denote the n-torus. For each i = 1,...,n, let 
7: [0,1] — T” be the curve segment 


yi(t) = (1,...,e?7,...,1) (with e?*” in the ith place), 


where we think of T” as a subset of C” & R?”. Show that a closed 


covector field w on T” is exact if and only if a w=Ofori=1,...,n. 
(Hint: Consider first E*w, where E: R" — T” is the covering map 
Bul gat) (ert a cet y | 


If F: M — N is a smooth map, show that F*: T*N — T*M is 
smooth. 


Consider the smooth function det: GL(n,R) — R. 


(a) Using matrix entries (A?) as global coordinates on GL(n,R), 
show that the partial derivatives of the determinant map 
det: GL(n,R) — R are given by 


ae det(A) = (det A)(A7!)!. 
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(Hint: Expand det A by minors along the ith column and use 
Cramer’s rule.| 


(b) Conclude that the differential of the determinant function is 
d(det) 4(B) = (det A) tr(A~'B) 


for A € GL(n, R) and B € T,4 GL(n,R) & M(n,R), where tr A = 
>>, Ai is the trace of A. 


D 


Submanifolds 


Many of the most familiar examples of manifolds arise naturally as subsets 
of other manifolds—for example, the n-sphere is a subset of R"*! and the 
n-torus T” = S! x --- x S! is a subset of C x --- x C = C”. In this chapter, 
we will explore conditions under which a subset of a smooth manifold can 
be considered as a smooth manifold in its own right. As you will soon 
discover, the situation is quite a bit more subtle than the analogous theory 
of topological subspaces. 

Because submanifolds are typically presented as images or level sets of 
smooth maps, a good portion of the chapter is devoted to analyzing the 
conditions under which such sets are smooth manifolds. We begin by in- 
troducing three special types of maps whose level sets and images are well 
behaved: submersions, immersions, and embeddings. Then we define the 
most important type of smooth submanifolds, called embedded submani- 
folds. These are modeled locally on linear subspaces of Euclidean space. 

Next, in order to show how submersions, immersions, and embeddings 
can be used to define submanifolds, we will prove an analytic result that 
will prove indispensable in the theory of smooth manifolds: the inverse 
function theorem. This theorem and its corollaries show that, under certain 
hypotheses on the rank of its push-forward, a smooth map behaves locally 
like its push-forward. 

The remainder of the chapter consists of various applications of the in- 
verse function theorem to the study of submanifolds. We show that level 
sets of submersions, level sets of constant-rank smooth maps, and images 
of embeddings are embedded submanifolds. We also observe that the image 
of an injective immersion looks locally like an embedded submanifold, but 
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may not be one globally; this leads to the definition of a more general kind 
of submanifold, called an immersed submanifold. 

At the end of the chapter, we apply the theory of submanifolds to study 
conditions under which an algebraic subgroup of a Lie group is itself a Lie 
group. 


Submersions, Immersions, and Embeddings 


Because the push-forward of a smooth map F at a point p represents the 
“best linear approximation” to F near p, we can learn something about 
F itself by studying linear-algebraic properties of its push-forward at each 
point. The most important such property is its rank (the dimension of its 
image). 

If F: M — N is a smooth map, we define the rank of F at p € M to 
be the rank of the linear map Fi: T,M — Trip) N; it is of course just the 
rank of the matrix of partial derivatives of F' in any coordinate chart, or 
the dimension of Im Fy. C Typ) N. If F has the same rank k at every point, 
we say it has constant rank, and write rank F = k. 

An immersion is a smooth map F’: M — N with the property that F,, 
is injective at each point (or equivalently rank F = dim M). Similarly, a 
submersion is a smooth map F: M — WN such that Fy is surjective at 
each point (equivalently, rank F = dim N). As we will see in this chapter, 
immersions and submersions behave locally like injective and surjective 
linear maps, respectively. 

One special kind of immersion is particularly important. A (smooth) 
embedding is an injective immersion F': M — N that is also a topologi- 
cal embedding, i.e., a homeomorphism onto its image F(M) Cc N in the 
subspace topology. Since this is the primary kind of embedding we will 
be concerned with in this book, the term “embedding” will always mean 
smooth embedding unless otherwise specified. 


Example 5.1 (Submersions, Immersions, and Embeddings). 


(a) If My,..., MM, are smooth manifolds, each of the projections 7;: M1 x 
- x My, — M; is a submersion. In particular, the projection 
a: R"+* —; R” onto the first n coordinates is a submersion. 


(b) Similarly, with M1,..., Mz as above, if p; © M; are arbitrarily chosen 
points, each of the maps 1;: M; — M, x --- x Mx given by 


13 (q) = (pi,-- -)>Pj-159,Pj+1)- . +, Dk) 


is an embedding. In particular, the inclusion map R” @ R"+* given 
by sending (z1,...,2”) to (xz!,...,2",0,...,0) is an embedding. 
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(c) If y: J — M is a smooth curve in a smooth manifold M, then ¥ is 
an immersion if and only if 7/(t) 4 0 for all t € J. 


(d) Any smooth covering map 7: M = M is both an immersion and a 
submersion. 


(e) If E is a smooth vector bundle over a smooth manifold M, the pro- 
jection map 7: E — M is a submersion. 


(f) Let T? = S' x S! denote the torus. The smooth map F': T? — R? 
given by 


F(e'?, e*®) = ((2 + cosy) cos 8, (2 + cosy) sin 0, sin y) 


is a smooth embedding of T? into R® whose image is the doughnut- 
shaped surface obtained by revolving the circle (y—2)? +2? +1 about 
the z-axis. 


Exercise 5.1. Verify the claims in the preceding example. 


To understand more fully what it means to be an embedding, it is useful 
to bear in mind some examples of injective immersions that are not embed- 
dings. The next two examples illustrate two rather different ways in which 
an injective immersion can fail to be an embedding. 


Example 5.2. Consider the map y: (—1/2,37/2) — R? given by 
¥(t) = (sin 2t, cost). 


Its image is a curve that looks like a figure eight in the plane (Figure 
5.1). (It is the locus of points (x,y) where x? = 4y?(1 — y?), as you can 
check.) It is easy to check that it is an injective immersion because y’(t) 
never vanishes; but it is not a topological embedding, because its image is 
compact in the subspace topology while its domain is not. 


Example 5.3. Let T? = S! x S' Cc C? denote the torus, and let ¢ by any 
irrational number. The map y: R — T? given by 


(t) — (ern, ene) 


is an immersion because y(t) never vanishes. It is also injective, because 
4(t1) = 7(t2) implies that both t, — tg and ct, — ctg are integers, which is 
impossible unless t, = to. 

Consider the set y(Z) = {y(n) : n € Z}. If y were a homeomorphism 
onto its image, this set would have no limit point in y(R), because Z has 
no limit point in R. However, we will show that (0) is a limit point of 
4(Z). To prove this claim, we need to show that given any ¢ > 0, there is 
a nonzero integer k such that |y(k) — y(0)| <. 
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—7/2 0 1/2 a 37/2 


FIGURE 5.1. The figure eight curve of Example 5.2. 


Since S' is compact, the infinite set {e?7"°" : n € Z} has a limit point, 
say zo € S!. Given ¢ > 0, we can choose distinct integers n, and nz such 
that |e?"°"7 — z9| < ¢/2, and therefore |e?7°" — e?7en2| < ©, Taking 
k =n, — ng, this implies that 


[erree = 1 pa [e-ane (erent _ ee = iene _ igs | < e: 


and so 


lv(k) — 7(0)| = |, 77") — (1, I) <e. 


In fact, it is not hard to show that the image set 7(R) is actually dense in 
T? (see Problem 5-4). 

As the next lemma shows, one simple criterion that rules out such cases 
is to require that F’ be a closed map (i.e., V closed in M implies F(V) 
closed in NV), for then it follows easily that it is a homeomorphism onto 
its image. Another is that F' be a proper map, which means that for any 
compact set K Cc N, the inverse image F~!(K) is compact. 


Proposition 5.4. Suppose F: M — N is an injective immersion. If any 
one of the following conditions holds, then F is an embedding with closed 
mage. 


(a) F is a closed map. 
(b) F is a proper map. 
(c) M is compact. 


Proof. For set-theoretic reasons, there exists an inverse map F~!: F(M) > 
M, and F is an embedding if and only if F~! is continuous. If F is closed, 
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then for every closed set V C M, (F~!)~1(V) = F(V) is closed in N and 
therefore also in F(M). This implies F'~' is continuous, and proves (a). 
Every proper map between manifolds is closed (see [Lee00, Prop. 4.32]), so 
(a) implies (b). Finally, a simple topological argument (see [Lee00, Lemma 
4.25] shows that every continuous map from a compact space to a Hausdorff 
space is closed, so (a) implies (c) as well. 


Exercise 5.2. Show that a composition of submersions is a submersion, 
a composition of immersions is an immersion, and a composition of embed- 
dings is an embedding. 


Embedded Submanifolds 


Smooth submanifolds are modeled locally on the standard embedding of 
R* into R”, identifying R* with the subspace 


{(a,..., 0°, a°71,...,07):a*t1 =... =o" =0} 


of R”. Somewhat more generally, if U is an open subset of R”, a k-slice of 
U is any subset of the form 


S={(a',...,0°,a°71,...,a") e€ Use a ,... 02" =e} 


for some constants c*+!,...,¢". Clearly any k-slice is homeomorphic to an 
open subset of R*. (Sometimes it is convenient to consider slices defined by 
setting some other subset of the coordinates equal to constants instead of 
the last ones. The meaning should be clear from the context.) 

Let M be a smooth n-manifold, and let (U,y) be a smooth chart on M. 
We say a subset S C U is a k-slice of U if y(S) is a k-slice of y(U). A 
subset N C M is called an embedded submanifold of dimension k (or an 
embedded k-submanifold or a regular submanifold) of M if for each point 
p € N there exists a chart (U,y) for M such that p € U and UN N 
is a k-slice of U. In this situation, we call the chart (U,y) a slice chart 
for N in M, and the corresponding coordinates (x!,...,2”) are called 
slice coordinates. The difference n — k is called the codimension of N in 
M. By convention, we consider an open submanifold to be an embedded 
submanifold of codimension zero. 

The definition of an embedded submanifold is a local one. It is useful to 
express this formally as a lemma. 


Lemma 5.5. Let M be a smooth manifold and N a subset of M. Suppose 
every point p € N has a neighborhood U C M such that UN N is an 
embedded submanifold of U. Then N is an embedded submanifold of M. 


Exercise 5.3. Prove Lemma 5.5. 
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The next proposition explains the reason for the name “embedded sub- 
manifold.” 


Proposition 5.6. Let N Cc M be an embedded k-dimensional submanifold 
of M. With the subspace topology, N is a topological manifold of dimension 
k, and it has a unique smooth structure such that the inclusion map N <= 
M is a smooth embedding. 


Proof. N is automatically Hausdorff and second countable because M is, 
and both properties are inherited by subspaces. To see that it is locally 
Euclidean, we will construct an atlas. For this proof, let 7: R" — R* denote 
the projection onto the first k coordinates. For any slice chart (U, ), let 


V=UNN, 
V =o lV), 
w=noyglv:VoV. 


Then ~ is easily seen to be a homeomorphism, because it has a continuous 
inverse given by py! o jl, where j: R* — R” is the smooth map 


Thus N is a topological k-manifold, and the inclusion map 1: N @ M isa 
topological embedding (i.e., a homeomorphism onto its image). 

To see that N is a smooth manifold, we need to check that the charts 
constructed above are smoothly compatible. Suppose (U, vy) and (U’, y’) are 
two slice charts for N in M, and let (V,w), (V’,w’) be the corresponding 
charts for N. The transition map is given by yoy! =roy’og toj, 
which is a composition of the smooth maps 7, y’ o y~!, and j. Thus the 
atlas we have constructed is in fact a smooth atlas, and defines a smooth 
structure on NV. In any such chart, the inclusion map N — M has the 
coordinate representation 


which is obviously an immersion. Since the inclusion is an injective immer- 
sion and a topological embedding, it is a smooth embedding as claimed. 

The last thing we have to prove is that this is the unique smooth struc- 
ture making the inclusion map a smooth embedding. Suppose that A is a 
(possibly different) smooth structure on N with the property that N — M 
is a smooth embedding. To show that the given smooth structure is the 
same as the one we have constructed, it suffices to show that each of the 
charts we constructed above is compatible with every chart in A. Thus let 
(U, vy) be aslice chart for N in M, let (V, w) be the corresponding chart for 
N constructed above, and let (W,6) be an arbitrary chart in A. We need 
to show that ~o@7!: (0WOAV) = WW NV) is a diffeomorphism. 
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Observe first that wo 67! is a homeomorphism, and is smooth because 

it can be written as the following composition of smooth maps: 
Wav) woveu RR, 

where we think of WMV as an open subset of N (with the smooth structure 
A) and U as an open subset of M. To prove that it is a diffeomorphism, 
we will show that wo 67! is an immersion and appeal to Proposition 5.7 
below, which says that such a map is automatically a diffeomorphism. To 
show it is an immersion, we must show that its push-forward is injective. 
By the argument above, (wo0~'), = 70~40l%0(0~1),. Each of the linear 
maps (x, lx, and (@~+), is injective—in fact y, and (0~'), are bijective— 
and thus their composition is injective. Although 7, is not injective, the 
composition will be injective provided Im(y ov 0 67'), N Ker, = @ (see 
Exercise A.10(b) in the Appendix). Since v takes its values in N, yo.o 7+ 
takes its values in the slice where the coordinates 2*+1,..., x2” are constant: 


porod 'y',...,y”) = (a1 (y),..., 2% (y), Ft, ...,c). 


It follows easily that the push-forward of this map at any point takes 
its values in the span of (e1,...,e,), which has trivial intersection with 
Ker 7, = span(ex41,---;€n)- 


In general, a smooth homeomorphism need not have a smooth inverse. 
A simple counterexample is the map F: R — R given by F(x) = x3, whose 
inverse map is not differentiable at the origin. The problem in this example 
is that the derivative of F vanishes at the origin, which forces the derivative 
of the inverse map to blow up. As the next proposition shows, if a smooth 
homeomorphism is also an immersion, the inverse map will be smooth. 


Proposition 5.7 (Smoothness of Inverse Maps). Suppose M and N 
are smooth manifolds of the same dimension, and F: M — N is a homeo- 
morphism. that is also a smooth immersion. Then F—! is smooth, so F is 
a diffeomorphism. 


Proof. The only thing that needs to be proved is that F~! is smooth, which 
is a local property, so by restricting to coordinate domains and replacing 
F with its coordinate representation, we may as well assume that M and 
N are open subsets of R”. 

The assumption that F' is an immersion means that the total derivative 
DF(a) is injective for each a € M, and therefore is invertible for dimen- 
sional reasons. If F~! were differentiable at 6 € N, the chain rule would 
imply 


Id = D(F 0 F~')(b) 
= DF(F~'(b)) 0 DF7(b), 
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from which it would follow that DF~!(b) = DF(F~1+(b))~'. We will be- 
gin by showing that F~' is differentiable at each point of N, with total 
derivative given by this formula. 

Let b € N and set a = F~1(b) € M. For v,w € R” small enough that 
at+tvéM andb+weEN, define R(v) and S(w) by 


R(v) = Flat v) — F(a) — DF(a)u, 
S(w) = F7\(b+ w) — F7+(b) — DF(a)~1w. 


Because F' is smooth, it is differentiable at a, which means that 
limy+o0 R(v)/|v| = 0. We need to show that lim,,.9 S(w)/|w| = 0. 
For sufficiently small w € R”, define 


v(w) = F-1(b + w) — F-1(b) = F71(b + w) — a. 
It follows that 


Fl (b+w) = F-1(b) + o(w) = a+ v(w), 


w= (b+w) —b= F(a+ v(w)) — F(a), ee) 


and therefore 


= v(w) — DF(a)~'w 
= DF(a)~'(DF(a)v(w) — w) 
= DF(a)-(DF(a)o(w) + F(a) — F(a + (w))) 


We will show below that there are positive constants c and C' such that 
clw| < |u(w)| < Clu] (5.2) 


for all sufficiently small w. In particular, this implies that v(w) 4 0 when 
w is sufficiently small and nonzero. From this together with the result of 
Exercise A.24 in the Appendix, we conclude that 


wl Pa 
1, R@o(w))] [o(w)! 
ea ear aa 

< C|DF(a)"}| oe 


which approaches zero as w — 0 because u(w) — 0 and F is differentiable. 
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To complete the proof that F'—' is differentiable, it remains only to prove 
(5.2). From the definition of R(v) and (5.1), 


v(w) = DF(a)~'DF(a)v(w) 
= DF(a)"'(F(a+ v(w)) — F(a) — R(v(w))) 
= DF(a)~'(w — R(v(w))), 


which implies 
|v(w)| < |DF(a)~"| |w| + |DF(a)~"| |R(v(w))]. 
Because |R(v)|/|v]| + 0 as v — 0, there exists 6; > 0 such that |v] < 6) 
implies |R(v)| < |v|/(2|)DF(a)~+]). By continuity of F~', there exists d2 > 
0 such that |w| < 62 implies |v(w)| < 61, and therefore 
|v(w)| < |DF(a)~*"| wl + (1/2)|v(w)}.- 
Subtracting (1/2)|v(w)| from both sides, we obtain 
|v(w)| < 2|DF(a)~*| |w| 


whenever |w| < 62. This is the second inequality of (5.2). To prove the first, 
we use (5.1) again to get 


w= F(a+ vo(w)) — F(a) = DF(a)u(w) + R(o(w)). 


Therefore, when |w| < 62, 


jul < [D(a lo(w)] + 1RCow))| < (DFC) + see) low) 


This completes the proof that F'—! is differentiable. 

By Exercise A.25, the partial derivatives of F~' are defined at each point 
y € N. Observe that the formula DF~'(y) = DF(F~'(y))~! implies that 
the map DF~!: N — GL(n,R) can be written as the composition 


nw £5 mM 2 GL(n,R) > GL(n,R), (5.3) 


where i(A) = A~!. Matrix inversion is a smooth map, because GL(n, R) is 
a Lie group. Also, DF is a smooth map because its component functions 
are the partial derivatives of F', which are assumed to be smooth. Because 
DF is a composition of continuous maps, it is continuous, and therefore 
the partial derivatives of F~! are continuous, which means that F~! is of 
class C1. 

Now assume by induction that we have shown F~! is of class C’. This 
means that each of the maps in (5.3) is of class C*. Because DF! is 
a composition of C* functions, it is itself C*; this implies that the partial 
derivatives of F~! are of class C*, so F! itself is of class C*+!. Continuing 
by induction, we conclude that F~! is smooth. 
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The Tangent Space to an Embedded Submanifold 


If N is an embedded submanifold of R”, we intuitively think of the tangent 
space T,,N at a point of N as a subspace of the tangent space T;,R”. Sim- 
ilarly, the tangent space to a submanifold of an abstract smooth manifold 
can be viewed as a subspace of the tangent space to the ambient manifold, 
once we make appropriate identifications. 

Let M be a smooth manifold, and let N Cc M be an embedded sub- 
manifold. Since the inclusion map 1: N <— M is an immersion, at each 
point p € N we have an injective linear map .: T,N — T,M. We will 
adopt the convention of identifying T;,N with its image under this map, 
thereby thinking of T;,,N as a certain linear subspace of T,M. Thought of 
as a derivation, a vector X € T,N, identified with 1.X € T,M, acts on 
smooth functions on M in the following way: 


Xf = (AX) f = X(for) =X (fl). 


The next proposition gives a useful way to characterize T,, N as a subspace 
of TM. 


Proposition 5.8. Suppose N C M is an embedded submanifold and p © 
N. As a subspace of Tp,M, the tangent space T,N is given by 


T,N ={X €T,M : Xf =0 whenever f € C~(M) and f|w =O}. 


Proof. First suppose X € T,N C T,M. This means, more precisely, that 
X = 1,Y for some Y € 7T,N. If f is any smooth function on M that 
vanishes on NV, then fot =0, so 


Xf =(u4Y)f =Y(f ov) =0. 


Conversely, if X € TM satisfies X f = 0 whenever f vanishes on N, 
we need to show that there is a vector Y € T,,N such that X = 1,Y. Let 
(z!,...,a”) be slice coordinates for N in some neighborhood U of p, so 
that UN N is the subset of U where x*+! =--.= a” =0, and (a!,..., 2") 
are coordinates for UM N. Because the inclusion map 1: NU — M has 
the coordinate representation 


u(t, ...,@") = (a,...,2",0,...,0) 


in these coordinates, it follows that T,.N (that is, 1.7) is exactly the sub- 
space of T,M spanned by 0/0z1|p,...,0/0x*|,. If we write the coordinate 
representation of X as 


d 


Pp 


ce 2s agi 


we see that X € T,N if and only if X‘ = 0 fori > k. 
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Let y be a bump function supported in U that is equal to 1 in a neighbor- 
hood of p. Choose an index j > k, and consider the function f(x) = y(x)a, 
extended to be zero on M \ U. Then f vanishes identically on N, so 


o= xp => x 2EO) py = x0, 


Thus X € T;,,N as desired. 


Examples of Embedded Submanifolds 


One straightforward way to construct embedded submanifolds is by using 
the graphs of smooth functions. Let U C R” be an open set, and let F: U — 
R* be a smooth function. The graph of F is the subset of R” x R* defined 
by 


T(F) = {(a,y) € R” x R*: 2 €U and y= F(a)}. 
Lemma 5.9 (Graphs as Submanifolds). If U Cc R” is open and 


F: U — R* is smooth, then the graph of F is an embedded n-dimensional 
submanifold of R'+*. 


Proof. Define a map y: U x R* > U x R* by 
o(a,y) = (#,y — F(a). 


It is clearly smooth, and in fact it is a diffeomorphism because its inverse 
can be written explicitly: 


yp '(u,v) = (u,v + F(u)). 


Because y(I'(F)) is the slice {(u,v) : v = 0} of U x R*, this shows that 
I(F) is an embedded submanifold. 


Example 5.10 (Spheres). To show that S” is an embedded submanifold 
of R"*+, we use the preceding lemma together with Lemma 5.5. Let B” be 
the open unit ball in R”, and define functions P+: B” — R by 


F*(u) =+V1-—|ul?. 


For any 7 € {1,...,n}, the intersection of S” with the open set where x’ > 0 
is the graph of the smooth function 


Pe! Cha eee oe Per) 


3 d 


Similarly, the intersection of S” with {x : 2’ < 0} is the graph of F~. Since 
every point in S” is in one of these sets, Lemma 5.5 shows that S” is an 
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embedded submanifold of R"+!. The smooth structure thus induced on S” 
is the same as the one we defined in Chapter 1: In fact, the coordinates 
for S” defined by these slice charts are exactly the graph coordinates we 
defined in Example 1.11. 


Exercise 5.4. Let U = {(x,y,z): 2,y,z > 0} C R®. and let 6: U > R? 
be the spherical coordinate map 


B(x, Yy, z) = (p, Y,; 6) 


=| Jz? +y? +2, cos as nl 
/ 2 + y? + 22 /x? + y? 


Show that (U,&) is a slice chart for S? in R®. 


As Example 5.10 illustrates, showing directly from the definition that a 
subset of a manifold is an embedded submanifold can be somewhat cum- 
bersome. In practice, submanifolds are usually presented to us in one of 
the following two ways: 


e LEVEL SET OF A SMOOTH MAP: Many submanifolds are most natu- 
rally defined as the set of points where some smooth map takes on a 
fixed value, called a level set of the map. For example, the n-sphere 
S” Cc R"*! is defined as the level set f~1(1), where f: R"*! => R is 
the function f(x) = |z|?. 


e IMAGE OF A SMOOTH MAP: In some cases, it is more natural to de- 
scribe a submanifold as the image of a smooth map. For example, 
the map F': R? — R® given by F(0,y~) = ((2 + cosy) cos, (2 + 
cos y) sin 8, sin y) has as its image a doughnut-shaped torus of revo- 
lution. 


Thus two important questions we will need to address are: 
e When is a level set of a smooth map an embedded submanifold? 
e When is the image of a smooth map an embedded submanifold? 


It is easy to construct examples of both cases that are not embedded 
submanifolds. For example, the smooth map F: R — R? defined by 
F(t) = (t?,t?) has as its image a curve that has a “cusp” or “kink” at 
the origin. Similarly, the map ®: R? — R given by ®(a,y) = xy has the 
union of the x and y axes as its zero level set. As Problem 5-5 shows, neither 
of these sets is an embedded submanifold of R?. 

The rest of this chapter is devoted to the study of these two questions, 
and in particular to developing sufficient conditions under which both kinds 
of sets are embedded submanifolds. 
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The Inverse Function Theorem and Its Friends 


The key to answering the questions at the end of the previous section is 
understanding how the local behavior of a smooth map is modeled by the 
behavior of its push-forward. To set the stage, we will consider a linear 
version of the problem: Let S be a k-dimensional linear subspace of R”, 
and let us examine how we might use linear maps to define it. 

First, every subspace S is the kernel of some linear map. (Such a linear 
map is easily constructed by choosing a basis for S and extending it to 
a basis for R".) By the rank-nullity law, if S = Ker LZ, then Im LZ must 
have dimension n — k. Therefore, a natural way to define a k-dimensional 
subspace S C R” is to give a surjective linear map L: R” > R"—* whose 
kernel is S. The vector equation Lx = 0 is equivalent to n — k scalar 
equations, each of which can be thought of as “cutting out” one more 
dimension of S. 

On the other hand, every subspace is also the image of some linear map. A 
choice of basis for S can be used to define an injective linear map E: R* > 
R” whose image is S. Such a map can be thought of as a “parametrization” 
of S. 

In the context of smooth manifolds, the analogue of a surjective linear 
map is a submersion, and the analogue of an injective linear map is an 
immersion. Let M be an n-manifold. By analogy with the linear situation, 
we might expect that a level set of a submersion from M to an (n — k)- 
manifold is an embedded k-dimensional submanifold of M. We will see 
below that this is the case. Analogously, we might expect that the image of 
a smooth embedding from a k-manifold to M is an embedded k-dimensional 
submanifold. This is also the case. 

The basis for all these results is the following analytic theorem. It is 
the simplest of several results we will develop in this section that show 
how the local behavior of a smooth map is modeled by the behavior of its 
push-forward. 


Theorem 5.11 (Inverse Function Theorem). Suppose M and N are 
smooth manifolds and F: M — N is a smooth map. If Fy. is invertible at 
a point p € M, then there exist connected neighborhoods Up of p and Vo of 
F(p) such that F: Up — Vo is a diffeomorphism. 


The proof of this theorem is based on the following elementary result 
about metric spaces. If X is a metric space, a map G: X — X is said to be 
a contraction if there is a constant X < 1 such that d(G(a), G(y)) < Ad(z, y) 
for all x,y € X. Clearly any contraction is continuous. 


Lemma 5.12 (Contraction Lemma). Let X be a complete metric 
space. Every contraction G: X — X has a unique fixed point, t.e., a point 
x €X such that G(x) = «2. 
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Proof. Uniqueness is immediate, for if x and x’ are both fixed points of 
G, the contraction property implies d(x, 2’) = d(G(x),G(2’)) < Ad(a, 2’), 
which is possible only if 2 = 2’. 

To prove the existence of a fixed point, let xo be an arbitrary point in 
X, and define a sequence {x,,} inductively by rn41 = G(a,). For any i > 1 
we have d(a;, 2:41) = d(G(ai_1), G(ai)) < Ad(a;_-1,2;), and therefore by 
induction 


d(x;, Vi41) < d'd(xo, £1). 
Ifj>i>N, 


d(xi,%3) < d(xi, 541) + dls, 42) +--+ + d(xj-1,25) 
< (A +++) + 971 )d(x9, 21) 


< ¥ (359°) deo.) 


\N 
< Toy U0, 21). 
Since this last expression can be made as small as desired by choosing 
N large, the sequence {x,,} is Cauchy and therefore converges to a limit 
x € X. Because G is continuous, 
G(z) =G ( lim Zi) = lim G(a,) = lim @pii = 2, 
n—-co n— co 


TF OS 


so «x is the desired fixed point. 


Proof of the inverse function theorem. ‘The fact that F.: T,M — Tryp) N 
is an isomorphism implies that M and N have the same dimension n. 
Choose coordinate domains U centered at p and V centered at F(p); con- 
sidering the coordinate maps as identifications as usual, we may as well 
assume that U and V are actually open subsets of R” and F(0) = 0. The 
map F = DF(0)~!oF satisfies F,(0) = 0 and DF, (0) = Id. If the theorem 
is true for Fj), then it is true for F = DF(0) o F,. Henceforth, replacing F’ 
by F\, we will assume that F is defined in a neighborhood of 0, F'(0) = 0, 
and DF(0) = Id. 

Let H(z) = x — F(x). Then DH(0) = Id—Id = 0. Because the matrix 
entries of DH(x) are continuous functions of x, there is a number ¢ > 0 
such that |DH(x)| < 1/2 for all « € B.(0). If x, 2’ € B.(0), the Lipschitz 
estimate for smooth functions (Proposition A.28 in the Appendix) implies 


|H(a’) — H(x)| < 5|a’ — a. (5.4) 


Since a’ — ¢ = F(a’) — F(x) + H(2’) — H(z), it follows that 


|x’ —2| <|F(2’) — F(a)| +|H(2’) — H(2)| < |F(2’) — F(a)| + gle’ — a]. 
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Subtracting $|2’ — z| from both sides, we conclude that 


In’ — a] < 21F(2") — F(o)| (5.5) 

for all z,x’ € B,(0). In particular, this shows that F is injective on B-(0). 

Now let y € B-/2(0) be arbitrary. We will show that there exists 2 € 

B.(0) such that F(x) = y. Let G(x) = H(x) +y = x — F(x) + y, so that 
G(a) = « if and only if F(x) = y. If |x| < e, (5.4) implies 


IG(@)| <H(@)1 + lol < Sal + § Se (5.6) 
so G maps B-(0) to itself. It follows from (5.4) that |G(v) — G(2’)| = 
|H(x) — H(2’)| < $|a’ — 2], so G is a contraction. Since B.(0) is compact 
and therefore complete, the contraction lemma implies that G has a unique 
fixed point x € B-(0). From (5.6), |a| = |G(x)| < ©, so in fact x € B.(0), 
thus proving the claim. 

Let U = B.(0)N F~!(Bz/2(0)). Then U is open, and the argument above 
shows that F: U — B./2(0) is bijective, so F~*: Bz/2(0) — U exists. 
Substituting « = F~(y) and 2’ = F~'(y’) into (5.5) shows that F~} is 
continuous. Let Up be the connected component of U containing 0, and 
Vo = F(Uo). Then F: Up — Vo is a homeomorphism, and Proposition 5.7 
shows that it is a diffeomorphism. 


For our purposes, the most important consequence of the inverse function 
theorem is the following, which says that a constant-rank smooth map can 
be placed locally into a particularly simple canonical form by a change of 
coordinates. It is a nonlinear version of the canonical form theorem for 
linear maps (Theorem A.4). 


Theorem 5.13 (Rank Theorem). Suppose M and N are smooth mani- 
folds of dimensions m and n, respectively, and F: M — N is a smooth map 
with constant rank k. For each p € M there exist coordinates (x',..., 2") 
centered at p and (v',...,v") centered at F(p) in which F has the following 
coordinate representation: 


PE ny) Se ee" 050): (5.7) 


Proof. Replacing M and N by coordinate domains U C M near p and 
V CN near F(p) and replacing F' by its coordinate representation, we 
may as well assume that Ff’: U — V where U is an open subset of R™ and 
V is an open subset of R”. The fact that DF(p) has rank k implies that 
its matrix has some k x k minor with nonzero determinant. By reordering 
the coordinates, we may assume that it is the upper left minor, (OF'/0z) 
for i,7 = 1,...,k. Let us relabel the standard coordinates as (2,y) = 
(nef thy aigy™*) in R™ and (pw) = Ges. ow pi") in 
R”. If we write F(z,y) = (Q(a,y), R(x, y)), then our hypothesis is that 
(0Q'/Ozx*) is nonsingular at p. 
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Define y: U — R™ by 


pz, y) = (Q(z, y),y)- 


Its total derivative at p is 


0Q' aq 
Dee S Dai WP) Dya P) | 
0 Im—k 


which is nonsingular because its columns are independent. Therefore, by 
the inverse function theorem, there are connected neighborhoods Uo of p 
and Vo of y(p) such that y: Up — Vo is a diffeomorphism. Writing the 
inverse map as y (x,y) = (A(a,y), B(x, y)) for some smooth functions 
A: Vo — R¥ and B: Vy — R™-*, we compute 


(x,y) = o(y*(2,y)) 


= 9(A(z,y), B(x, y)) (5.8) 
= (Q(A(2, y); Bla, y)), B(x, y)). 


Comparing y components, it follows that B(x,y) = y, and therefore y™! 


has the form 
yo *(a,y) = (A(z,y),y). 


We will take y: Up — R”™ as our coordinate chart near p. Observe that 
poy! =Id implies Q(A(z, y), y) = 2, and therefore Foy! has the form 


Fo y*(z,y) a (x, R(x, y)), 


where we have put R(x, y) = R(A(z,y),y). 
The Jacobian matrix of this map at an arbitrary point (z,y) € (Uo) is 


Ty 0 
D(F 0 y7")(a,y) = aR’ ari 
Oxi Oys 


Since composing with a diffeomorphism does not change the rank of a 
map, this matrix has rank exactly equal to k everywhere in Up. Since the 
first k columns are obviously independent, the rank can be k only if the 
partial derivatives OR*/Oy) vanish identically on Uo, which implies that R 
is actually independent of (y',...,y’~*). Thus in fact 


Fo yp *(,y) a. (x, R(2)). 
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To complete the proof, we need to choose coordinates for R” near F'(p). 
Define ~: Vo — R” by 


b(v,w) = (v,w— R(v)). 


This is a diffeomorphism onto its image, because its inverse is given explic- 
itly by w1(v, w) = (v,w + R(v)); thus w is a coordinate chart on Vo. In 
terms of the coordinate maps y for the domain and w for the range, F' has 
the coordinate representation 


po Foo "(2,y) = ¥(2, R(a)) = (a, Rw) — R(z)) = (2,0), 


which was to be proved. 


The following corollary can be viewed as a somewhat more invariant 
statement of the rank theorem. 


Corollary 5.14. Let F: M — N be a smooth map, and suppose M is 
connected. Then the following are equivalent: 


(a) For each p € M there exist coordinates near p and F(p) in which the 
coordinate representation of F' is linear. 


(b) F has constant rank. 


Proof. First suppose F' has a linear coordinate representation in a neigh- 
borhood of each point. Since any linear map has constant rank, it follows 
that the rank of F' is constant in a neighborhood of each point, and thus 
by connectedness it is constant on all of 17. Conversely, if F’ has constant 
rank, the rank theorem shows that it has the linear coordinate representa- 
tion (5.7) in a neighborhood of each point. 


Another useful consequence of the inverse function theorem is the implicit 
function theorem, which gives conditions under which a level set of a smooth 
map is locally the graph of a smooth function. 


Theorem 5.15 (Implicit Function Theorem). Let U C R” x R* be 
an open set, and let (x,y) = (v!,...,a",y',...,y”) denote the standard 
coordinates on U. Suppose ®: U — R* is a smooth map, (a,b) € U, and 
c= @(a,b). If the k x k matriz 


is nonsingular, then there exist neighborhoods Vo of a and Wo of b and a 
smooth map F': Vo + Wo such that ®-!(c) Vo x Wo is the graph of F, 
i.e., O(a, y) =c for (x,y) € Vo x Wo if and only if y = F(z). 
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Proof. Under the hypotheses of the theorem, consider the smooth map 
WU: U — R” x R* defined by V(z, y) = (x, ®(z, y)). Its total derivative at 
(a, b) is 


Tn 0 
DW (a,b) = i i , 
Fat) ay 
Ox Oy) 


which is nonsingular by hypothesis. Thus by the inverse function theorem 
there exist connected neighborhoods Up of (a,b) and Yo of (a,c) such that 
W: Up — Yo is a diffeomorphism. Shrinking Up and Yo if necessary, we may 
assume Uy = V x W is a product neighborhood. Arguing exactly as in the 
proof of the rank theorem (but with the roles of x and y reversed), the 
inverse map has the form 


Uw! (a, y) = (x, B(z,y)) 


for some smooth map B: Yo — W. 

Now let Vo = {a € V: (a,c) € Yo} and Wy = W, and define F’: Vo — Wo 
by F(x) = B(a,c). Comparing y components in the relation (x,c) = Vo 
W-!(a,c) yields 


c= O(x, B(x, c)) = ®(a, F(x) 
whenever x € Vo, so the graph of F is contained in ®~!(c). Conversely, 
suppose (x,y) € Vo x Wo and ®(2,y) = c. Then U(2,y) = (x, ®(x,y)) = 


(x, €), 80 


(x,y) = o>" (aye) = (x, B(z,c)) = (z, F(x), 


which implies that y = F(x). This completes the proof. 


First Consequences 


When we apply the inverse function theorem and its consequences to maps 
between manifolds, we obtain a wealth of important results. The first one 
is a significant strenghthening of Proposition 5.7. 


Proposition 5.16. Suppose M and N are smooth manifolds of the same 
dimension, and F: M — N is a smooth immersion. Then F is a local 
diffeomorphism. If F is bijective, it is a diffeomorphism. 


Proof. The fact that F’ is an immersion means that F, is bijective at each 
point for dimensional reasons. Then the fact that F’ is a local diffeomor- 
phism is an immediate consequence of the inverse function theorem. If F 
is bijective, then it is a diffeomorphism by Exercise 2.5. 
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The next proposition is a powerful consequence of the rank theorem. (In 
Chapter 6, we will generalize this proposition to characterize surjective and 
bijective maps of constant rank.) 


Proposition 5.17. Let F: M — N be a smooth map of constant rank. If 
F is injective, then it is an immersion. 


Proof. Let m = dim M, n = dim N, and suppose F' has constant rank k. If 
F is not an immersion, then k < m. By the rank theorem, in a neighborhood 
of any point there is a chart in which F' has the coordinate representation 


F(c',...,2™) = (a',...,0*°,0,...,0). (5.9) 


It follows that F'(0,...,0,¢) = F(0,...,0,0) for any sufficiently small ¢, so 
F is not injective. 


Another important application of the rank theorem is to vastly expand 
our understanding of the properties of submersions. As the next few results 
show, surjective submersions play a role in smooth manifold theory closely 
analogous to the role played by quotient maps in topology. 


Proposition 5.18 (Properties of Submersions). Let 7: M — N be 
a submersion. 


(a) m is an open map. 
(b) Every point of M is in the image of a smooth local section of 7. 
(c) If m is surjective, it is a quotient map. 


Proof. Given p € M, let q = a(p) € N. Because a submersion has con- 


stant rank, by the rank theorem we can choose coordinates (a1,...,2) 
centered at p and (y',...,y”) centered at q in which 7 has the coordi- 
nate representation 7(x!,...,a™) = (x!,...,a*). If e is a sufficiently small 


positive number, the coordinate cube 
Coa {et ln! |< es fort 1,454 mt 
is a neighborhood of p whose image under 7 is the cube 


Cha Ty? ly Se tori =1s.kh 


The map a: @ — @, whose coordinate representation is o(21,..., 2") = 
(x!,...,a*,0,...,0) is asmooth local section of 7 satisfying o(q) = p. This 
proves (b). 


Suppose W is any open subset of M and q € 7(W). For any p € W such 
that (p) = q, W contains an open coordinate cube C, centered at p as 
above, and thus 7(W) contains an open coordinate cube centered at 7(p). 
This proves that W is open, so (a) holds. Because a surjective open map is 
automatically a quotient map, (c) follows from (a). 
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The next three propositions provide important tools that we will use 
frequently when studying submersions. The general philosophy is one that 
we will see repeatedly in this book: to “push” a smooth object (such as 
a smooth map) down via a submersion, pull it back via local sections. It 
is convenient to introduce the following terminology: If 7: M — N is any 
surjective map, the fiber of 7 over q € N is the set t~1(q) C M. 


Proposition 5.19. Suppose M, N, and P are smooth manifolds, 7: M — 
N is a surjective submersion, and F: N — P is any map. Then F is smooth 
if and only if F om is smooth: 


N 


F P. 


Proof. If F is smooth, then F' oz is smooth by composition. Conversely, 
suppose that Fo 7 is smooth. For any q € N, let a: U — M be a smooth 
section of 7 defined on a neighborhood U of qg. Then a 0 o = Idy implies 


F\ly = Foldy = (Fon)og, 


which is a composition of smooth maps. This shows that F' is smooth in a 
neighborhood of each point, so it is smooth. 


The next proposition gives a very general sufficient condition under which 
a smooth map can be “pushed down” by a submersion. 


Proposition 5.20 (Passing Smoothly to the Quotient). Suppose 
am: M — N is a surjective submersion. If F: M — P is a smooth map 
that is constant on the fibers of m, then there is a unique smooth map 
F: N— P such that For=F: 


M 


N =P. 
F 


Proof. Clearly, if F exists, it will have to satisfy F (q) = F(p) whenever 
p€m‘(q). We use this to define F: Given q € M, let F(q) = F(p), where 
p € M is any point in the fiber over g. (Such a point exists because we are 
assuming that 7 is surjective.) This is well-defined because F' is constant 
on the fibers of 7, and it satisfies Fonr=F by construction. Thus F is 
smooth by Proposition 5.19. 
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Our third proposition can be interpreted as a uniqueness result for 
smooth manifolds defined as quotients of other smooth manifolds by sub- 
mersions. 


Proposition 5.21. Suppose 7: M — Ny, and m2: M — No are surjective 
submersions that are constant on each other’s fibers (i.e., 71(p) = 71(p’) 
if and only if 72(p) = m2(p’)). Then there exists a unique diffeomorphism 
F: Ny — No such that Fo 7 = 19: 


M 
nf ~ 
Ny F No. 


Exercise 5.5. Prove Proposition 5.21. 


Level Sets 


Using the tools we have developed so far, we can now give some very general 
criteria for level sets to be submanifolds. 


Theorem 5.22 (Constant Rank Level Set Theorem). Let M and 
N be smooth manifolds, and let F: M — N be a smooth map with constant 
rank k. Each level set of F is a closed embedded submanifold of codimension 
kin M. 


Proof. Let c € N be arbitrary, and let S denote the level set F~!(c) C 
M. Clearly S is closed in M by continuity. To show it is an embedded 
submanifold, we need to show that for each p € S there is a slice chart for 
Sin M near p. From the rank theorem, there are coordinate charts (U, y) 
centered at p and (V,~) centered at c = F'(p) in which F has the coordinate 
representation (5.7), and therefore SU is the slice {(a,...,2") EU: 
fae. a SO}. 


Corollary 5.23 (Submersion Theorem). Jf F: M— N is a submer- 
sion, then each level set of F is a closed embedded submanifold whose codi- 
mension is equal to the dimension of N. 


Proof. A submersion has constant rank equal to the dimension of N. 


This result is extremely useful, because many submanifolds are most nat- 
urally presented as level sets of submersions. In fact, it can be strengthened 
considerably, because we need only check the rank condition on the level 
set we are interested in. If F: M — N is a smooth map, a point p © M 
is said to be a regular point of F if Fy: T,M — Ty(p)N is surjective; it is 
a critical point otherwise. (This means, in particular, that every point is 
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critical if dim M < dim N.) A point c € N is said to be a regular value of 
F if every point of the level set F~+(c) is a regular point, and a critical 
value otherwise. In particular, if F~'(c) = 2, c is regular. Finally, a level 
set F'~!(c) is called a regular level set if c is a regular value; in other words, 
a regular level set is a level set consisting entirely of regular points. 


Corollary 5.24 (Regular Level Set Theorem). Every regular level 
set of a smooth map is a closed embedded submanifold whose codimension 
is equal to the dimension of the range. 


Proof. Let F: M — N bea smooth map and let c € N be a regular value 
such that F~'(c) 4 @. The fact that c is a regular value means that F,, has 
rank equal to the dimension of N at every point of F~!(c). To prove the 
corollary, it suffices to show that the set U of points where rank Ff, = dim N 
is open in M, for then we can apply the preceding corollary with M replaced 
by U. 

To see that U is open, let m = dim M, n = dim N, and suppose p € U. 
Choosing coordinates near p and F(p), the assumption that rank F, = n 
at p means that the n x m matrix representing F;, in coordinates has an 
nxn minor whose determinant is nonzero. By continuity, this determinant 
will be nonzero in some neighborhood of p, which means that F’ has rank 
n in this whole neighborhood. 


Exercise 5.6. If f: 4 — R is a smooth real-valued function, show that 
p€M isa regular point of f if and only if df, £0. 


Example 5.25 (Spheres). Using Corollary 5.24, we can give a much 
easier proof that S” is an embedded submanifold of R"+!. The sphere is 
easily seen to be a regular level set of the function f: R"*! — R given by 
f(x) = |z|?, since df = 25>, x* dx’ vanishes only at the origin, and thus it 
is an embedded n-dimensional submanifold of R"*!. 


Example 5.26 (The Orthogonal Group). A real n x n matrix A is 
said to be orthogonal if the linear map A: R” — R” preserves the Euclidean 
inner product: 


(Az) - (Ay) =a y. 


The set O(n) of all orthogonal n x n matrices is clearly a subgroup of 
GL(n, R), called the n-dimensional orthogonal group. 

It is easy to see that a matrix A is orthogonal if and only if it takes the 
standard basis of R” to an orthonormal basis, which is equivalent to the 
columns of A being orthonormal. Since the (i, 7)-entry of the matrix A7A 
is the dot product of the ith and jth columns of A, this condition is also 
equivalent to the requirement that A? A = I,,. We will show that O(n) is 
an embedded submanifold of GL(n, R). 
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Let S(n,R) denote the set of symmetric n x n matrices, which is easily 
seen to be a linear subspace of M(n, R) of dimension n(n + 1)/2 because 
each symmetric matrix is uniquely determined by its values on and above 
the main diagonal. Define ®: GL(n,R) — S(n, R) by 


B(A) = ATA. 


We will show that the identity matrix J, is a regular value of ®, from which 
it follows that O(n) = ®~1(J,,) is an embedded submanifold of codimension 
n(n +1)/2, that is, of dimension n? — n(n + 1)/2 = n(n — 1)/2. 

Let A € O(n) be arbitrary, and let us compute the push-forward 
®,: T4aGL(n,R) — Tg:a)S(n,R). We can identify the tangent spaces 
T4 GL(n,R) and Tg:,) S(n,R) with M(n,R) and S(n,R), respectively, be- 
cause GL(n, R) is an open subset of the vector space M(n,R) and S(n,R) 
is itself a vector space. For any B € M(n,R), the curve y(t) = A+tB 
satisfies y(0) = A and y'(0) = B. We compute 


},B = ,7'(0) =(604)'(0) = Z| (A+ 4B) 


ae (A+tB)7(A+#B) = BTA+ A*B. 


t=0 


If C € S(n,R) is arbitrary, 
,($AC) = $C7ATA+ $A AC = 3C4+5C=C. 


Thus ®, is surjective, which proves the claim. 


Example 5.27 (The Special Linear Group). The special linear group 
SL(n, R) is the set of n x n matrices with determinant equal to 1. Because 
the determinant function satisfies det(AB) = det Adet B, SL(n,R) is a 
subgroup of GL(n, R). We will show that det: GL(n,R) — R is a smooth 
submersion, from which it follows that SL(n, R) = det~*(1) is an embedded 
submanifold. Let A € GL(n,R) be arbitrary. Problem 4-10 shows that 
the differential of det is given by d(det)4(B) = (det A) tr(A~'B) for B € 
T4 GL(n, R) = M(n,R). Choosing B = A yields 


d(det) 4(A) = (det A) tr(A~1 A) = (det A) tr(I,) = ndet A £ 0. 
This shows that d(det) never vanishes on GL(n,R), so det is a submersion 


and SL(n, R) is an embedded submanifold of dimension n? —1 in GL(n, R). 


Not all embedded submanifolds are naturally given as level sets of sub- 
mersions as in these examples. However, the next proposition shows that 
every embedded submanifold is at least locally of this form. 
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Proposition 5.28. Let S be a subset of a smooth n-manifold M. Then 
S is an embedded k-submanifold of M if and only if every point p € S 
has a neighborhood U in M such that UN S is a level set of a submersion 
F:U 3 R**, 


Proof. First suppose S is an embedded k-submanifold. If (x',...,2”) are 
slice coordinates for S on an open set U C M, the map F: U — R"-* given 
in coordinates by F(a) = (a*+!,...,2”) is easily seen to be a submersion 
whose zero level set is SMU. Conversely, suppose that around every point 
p € S there is a neighborhood U and a submersion F: U — R"~* such that 
SQU = F~1(c) for some c € R"~*. By the submersion theorem, SNU is an 
embedded submanifold of U, and so by Lemma 5.5, S' is itself an embedded 
submanifold. 


If S Cc M is an embedded submanifold, a smooth map F': M — N such 
that S is a regular level set of F' is called a defining map for S. In the special 
case N = R"-* (so that F is a real-valued or vector-valued function), it is 
usually called a defining function. Examples 5.25 through 5.27 show that 
f(x) = |2|? is a defining function for the sphere, (A) = A7 A is a defining 
function for O(n), and the determinant is a defining function for SL(n, R). 
More generally, if U is an open subset of M and F': U — N is a smooth 
map such that SU is a regular level set of F', then F is called a local 
defining map (or local defining function if N = R"~*) for S. Proposition 
5.28 says that every embedded submanifold admits a local defining function 
in a neighborhood of each point. 

The next lemma shows that defining maps give a concise characterization 
of the tangent space to an embedded submanifold. 


Lemma 5.29. Suppose S C M is an embedded submanifold. If F: U — N 
is any local defining map for S, then TpS = Ker Fy: TpM — Trp) N for 
each p € U. 


Proof. Recall that we identify T,S with the subspace u.(T,S) C T,M, 
where u: S — M is the inclusion map. Because Foz is constant on SONU, 
it follows that F,, ov, is the zero map from T,S to Tp(p)N, and therefore 
Imv, C Ker F,. On the other hand, F is surjective by the definition of a 
defining map, so the rank-nullity law implies that 


dim Ker F, = dim T,,M — dim Trip) N = dimT,S = dimIm1z,, 


which implies that Imv, = Ker Fy. 


Our next example is a bit more complicated. It will be of use to us in 
the next chapter. 


Example 5.30 (Matrices of fixed rank). As in Chapter 1, let M(m x 
n,R) denote the mn-dimensional vector space of m x n real matrices. For 
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any k, let My(m x n,R) denote the subset of M(m x n,R) consisting of 
matrices of rank k. We showed in Example 1.10 that My(m x n,R) is an 
open submanifold of M(m x n,R) when & = min(m,n). Now we will show 
that when 0 < k < min(m,n), Mz(m x n,R) is an embedded submanifold 
of codimension (m — k)(n — k) in M(m x n,R). 

Let Eo be an arbitrary m x n matrix of rank k. This implies that Eo 
has some k x & minor with nonzero determinant. For the time being, let us 
assume that it is the upper left minor. Writing Eo in block form as 


where Ao isa kx k matrix and Do is of size (m—k) x (n—k), our assumption 
is that Apo is nonsingular. 
Let U be the set 


v=4(¢ of EM(m x n,):det Azo}. 


By continuity of the determinant function, U is an open subset of M(m x 


n,R) containing Ep. Given E = (4%) € U, consider the invertible n x n 


matrix 
A-! —A-!B 
P={ Aas i) 


Since multiplication by an invertible matrix does not change the rank of a 
matrix, the rank of E is the same as that of 


A B\ (A) —A1B iF 0 
a é By or Ape le Gre Bee): 0) 


Clearly EP has rank k if and only if D-— CA~!B is the zero matrix. (To 
understand where P came from, observe that EF has rank k& if and only if 
it can be reduced by elementary column operations to a matrix whose last 
n —k columns are zero. Since elementary column operations correspond to 
right multiplication by invertible matrices, it is natural to look for a matrix 
P satisfying (5.10).) 

Thus we are led to define F: U — M(m—k x n—k,R) by 


A uBYi. 4 
F(é p)=D-CA B. 


Clearly F is smooth. To show that it is a submersion, we need to show 
that DF(EF) is surjective for each E € U. Since M(m—k x n—k,R) is 
a vector space, tangent vectors at F(£) can be naturally identified with 
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(m—k) x (n—k) matrices. Given E = (4 2%) and any matrix X € M(m— 
k x n—k,R), define a curve y: (—e,¢) — U by 


ae) = & bee 


Then 


F,7'(0) = (Fo 7)'(t) (D+tX —CA™'B)=X. 


t=0 


d 
dt 
Thus F' is a submersion and so My(m x n,R)NU is an embedded subman- 
ifold of U. 

Now if £ is an arbitrary matrix of rank k, just note that it can be 
transformed to one in U by a rearrangement of its rows and columns. Such 
a rearrangement is a linear isomorphism R: M(m x n,R) — M(m x n,R) 
that preserves rank, so U’ = R~1(U) is a neighborhood of Ej and F o 
R: U' — M(m—&k x n—k,R) is a submersion whose zero level set is 
M;(m x n,R) NU". Thus every point in M;(m x n,R) has a neighborhood 
U’ in M(mxn,R) such that U'NM;,(m xn, R) is an embedded submanifold 
of U’, so Mg (m x n,R) is an embedded submanifold by Lemma 5.5. 


Images of Embeddings and Immersions 


The next question we want to address is when the image of a smooth 
map is an embedded submanifold. The most important case is that of an 
embedding, for which the answer is provided by the following theorem. 


Theorem 5.31. The image of a smooth embedding is an embedded sub- 
manifold. 


Proof. Let F: N — M be an embedding. We need to show that each point 
of F(N) has a coordinate neighborhood U Cc M in which F(N)NU isa 
slice. 

Let p € N be arbitrary, and let (U,y), (V,w) be coordinate charts cen- 
tered at p and F(p) in which F|y: U — V has the coordinate representation 
(5.7). In particular (shrinking U and V if necessary), this implies that F'(U) 
is a slice in V. Because an embedding is a homeomorphism onto its image 
with the subspace topology, the fact that F(U) is open in F(.N) means 
that there is an open set W C M such that F(U) = W012 FN). Replacing 
V by V = VOW, we obtain a slice chart (V, w|~) containing F(p) such 
that Vn F(N) = VN F(U) is a slice of V. 


The preceding theorem combined with Proposition 5.6 can be summa- 
rized by the following corollary. 


Corollary 5.32. Embedded submanifolds are precisely the images of em- 
beddings. 
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Immersed Submanifolds 


Although embedded submanifolds are the most natural and common sub- 
manifolds and suffice for most purposes, it is sometimes important to con- 
sider a more general notion of submanifold. In particular, when we study Lie 
groups later in this chapter and foliations in chapter 14, we will encounter 
subsets of smooth manifolds that are images of injective immersions, but 
not necessarily of embeddings. To see what kinds of phenomena occur, look 
back again at the two examples we introduced earlier of injective immer- 
sions that are not embeddings. The “figure eight curve” of Example 5.2 
and the dense curve on the torus of Example 5.3 are both injective im- 
mersions but not embeddings. In fact, their image sets are not embedded 
submanifolds (see Problems 5-3 and 5-4). 

So as to have a convenient language for talking about examples like 
these, we make the following definition. Let M be a smooth manifold. An 
immersed submanifold of dimension k (or immersed k-submanifold) of M 
is a subset N C M endowed with a k-manifold topology (not necessarily 
the subspace topology) together with a smooth structure such that the 
inclusion map 4: N — M is a smooth immersion. 

Immersed submanifolds usually arise in the following way. Given an in- 
jective immersion Ff’: N — M, we can give the image set F(N) C Ma 
topology simply by declaring a set U C F(N) to be open if and only if 
F-1(U) C N is open. With this topology, F(N) is clearly a topological 
k-manifold homeomorphic to N, and there is a unique smooth structure on 
it such that F: N — F(N) is a diffeomorphism. (The smooth coordinate 
maps are just the maps of the form yo F~!, where ¢ is a smooth coordinate 
map for N.) With this topology and smooth structure, 1: F(N) — M is 
clearly a smooth immersion, because it is equal to the composition of a 
diffeomorphism followed by an immersion: 


-1 
F(N) => .N +) M. 


Example 5.33 (Immersed Submanifolds). Because the figure eight 
of Example 5.2 and the dense curve of Example 5.3 are images of injec- 
tive immersions, they are immersed submanifolds when given appropriate 
topologies and smooth structures. As smooth manifolds, they are diffeomor- 
phic to R. They are not embedded, because neither one has the subspace 
topology. 


Since the inclusion map of an immersed submanifold is by definition 
an injective immersion, this discussion shows that immersed submanifolds 
are precisely the images of injective immersions. Clearly every embedded 
submanifold is also an immersed submanifold. The converse is not true: 
An immersed submanifold is embedded precisely when it has the subspace 
topology, or equivalently when the immersion is an embedding. 
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The following lemma shows that the local structure of an immersed sub- 
manifold is the same as that of an embedded one. 


Lemma 5.34. Let F: N — M be an immersion. Then F is locally an 
embedding: For any p € N, there exists a neighborhood U of p in N such 
that F|y: U > M is an embedding. 


Exercise 5.7. Prove Lemma 5.34. 


It is important to be clear about what this lemma does and does not say. 
Given an immersed submanifold N C M and a point p € N, it is possible 
to find a neighborhood U of p (in N) such that U is embedded; but it may 
not be possible to find a neighborhood V of p in M such that VMN is 
embedded. 

Because immersed submanifolds are the more general of the two types of 
submanifolds, the term “submanifold” without further qualification means 
an immersed submanifold, which includes an embedded submanifold as a 
special case. If there is room for confusion, it is usually better to specify 
explicitly which type of submanifold is meant, particularly because some 
authors do not follow this convention, but instead reserve the unqualified 
term “submanifold” to mean what we call an embedded submanifold. 

Even though an immersed submanifold N C M is not a topological sub- 
space of MM, its tangent space at any point p € N can nonetheless be viewed 
as a linear subspace of T;,M, as for an embedded submanifold. If 1: N — M 
is the inclusion map, then v is a smooth immersion, so u,: T,N — T,M is 
injective. Just as in the case of embedded submanifolds, we will routinely 
identify T,.N with the subspace 1.7,N C T,M. 


The Case of Manifolds with Boundary 


The definitions of this chapter extend easily to manifolds with boundary. 
First, if M@ and N are manifolds with boundary, a smooth map F’: M — N 
is said to be a immersion if F, is injective at each point, a submersion if 
F, is surjective at each point, and an embedding if it is an immersion and 
a homeomorphism onto its image (with the subspace topology. A subset 
S Cc M is said to be an immersed submanifold of M if S is endowed with 
a smooth manifold structure such that the inclusion map is an immersion, 
and an embedded submanifold if in addition S has the subspace topology. 
(We do not necessarily require the existence of slice coordinates for em- 
bedded submanifolds, because such coordinates can be problematic if S 
contains boundary points of M.) 

More generally, an immersed or embedded submanifold with boundary 
in M is defined in exactly the same way, except that now S itself is allowed 
to have a boundary. 


Exercise 5.8. If M is a smooth manifold with boundary, show that 0M 
is a smooth submanifold of M. 
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Restricting Maps to Submanifolds 


Given a smooth map F': M — N, it is important to know whether F is 
still smooth when its domain or range is restricted to a submanifold. In the 
case of restricting the domain, the answer is easy. 


Proposition 5.35 (Restricting the Domain of a Smooth Map). If 
F: M—N is a smooth map and S C M is an (immersed or embedded) 
submanifold, then F|g: S — N is smooth. 


Proof. The inclusion map vt: S <— M is smooth by definition of an immersed 
submanifold. Since F'|5 = Fo, the result follows. 


When the range is restricted, however, the resulting map may not be 
smooth, as the following example shows. 


Example 5.36. Let N Cc R? be the figure eight submanifold, with the 
topology and smooth structure induced by the immersion y of Example 
5.2. Define a smooth map G: R > R? by 


G(t) = (sin 2t, cost). 


(This is the same formula that we used to define y, but now the domain 
is the whole real line instead of just a subinterval.) It is easy to check that 
the image of G lies in N. However, as a map from R to N, G is not even 
continuous, because y~! o G is not continuous at t = —7/2. 


The next proposition gives sufficient conditions for a map to be smooth 
when its range is restricted to an immersed submanifold. It shows that the 
failure of continuity is the only thing that can go wrong. 


Proposition 5.37 (Restricting the Range of a Smooth Map). Let 
SCN be an immersed submanifold, and let F: M — N be a smooth map 
whose image is contained in S. If F is continuous as a map from M to S, 
then F: M — S' is smooth. 


Proof. Let p € M be arbitrary and let q = F(p) € S. Because the inclusion 
map u: S — N is an immersion, Lemma 5.34 guarantees that there is a 
neighborhood V of g in S$ such that c|y: V < N is an embedding. Thus 
there exists a slice chart (W,w) for V in N centered at qg. (Of course, it 
might not be a slice chart fo S in N.) The fact that (W, 2) is a slice chart 
means that (Vo, w) is a chart for V, where Vo = WV and w = Tow, with 
m7: R" — R* the projection onto the first k = dim S coordinates. Since 
Vo = (e\v)~1(W) is open in V, it is open in N in its given topology, and 
so (Vo, w) is also a chart for N. 

Let U = F~1(Vo) C M, which is an open set containing p. (Here is where 
we use the hypothesis that F' is continuous.) Choose a coordinate chart 
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(Uo, y) for M such that p € Up C U. Then the coordinate representation 


of F: M — S' with respect to the charts (Uo, y) and (Vo, v) is 


WoFogt=no(poFoy}), 


which is smooth because F': M — N is smooth. 


In the special case in which the submanifold S is embedded, the conti- 
nuity hypothesis is always satisfied. 


Corollary 5.38 (Embedded Case). Let S Cc N be an embedded sub- 
manifold. Then any smooth map F: M — N whose image is contained in 
S is also smooth as a map from M to S. 


Proof. When S C N has the subpace topology, a continuous map F’: M — 
N whose image is contained in S is automatically continuous into S. (This 
is the characteristic property of the subspace topology—see [Lee00].) 


Vector Fields and Covector Fields on Submanifolds 


If N Cc M is an immersed or embedded submanifold, a vector field X on 
M does not necessarily restrict to a vector field on NV, because X, may not 
lie in the subspace T,,N C T,M at a point p € N. A vector field X on M 
is said to be tangent to N if X, € T,N C T,M for each p € N. 


Lemma 5.39. Let N C M be an immersed or embedded submanifold, and 
lett: N << M denote the inclusion map. If X is a smooth vector field on 
M that is tangent to N, then there is a unique smooth vector field on N, 
denoted by X|n, that is v-related to X. 


Proof. For a vector field Z € T(N) to be u-related to X means that t,Z) = 
X, for each p € N. Because the injective linear map 14: T,N — T,M is 
just inclusion (under our identification of T,.N with a subspace of T,/M/), 
this means that the unige vector field Z that is u-related to X, if one exists, 
must be given by Z, = Xp for all p € N. So we need only check that this 
defines a smooth vector field on N. 

Let p be any point in N. Since an immersed submanifold is locally em- 
bedded, there is a neighborhood V of p in N that is embedded in M. Let 
(y',...,y™) be slice coordinates for V in a neighborhood U of p in M, such 
that VNU is the set where yt! =--- = y™ = 0. We can write X locally 
on U as 
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At points g € VOU, T,N = T,V is spanned by 0/dy'|q,...,0/Oy”|q, so the 
condition that X be tangent to N means that X"t!(q) =---= X"™(q) =0 
at any point g © VNU. Thus our vector field Z has the local coordinate 
expression 


», @EVNU. 
q 


Since the component functions X° are restrictions of smooth functions on 
U, they are smooth, and thus Z is smooth. 


The restriction of covector fields to submanifolds is much simpler. Sup- 
pose NV C M is an immersed submanifold, and let 1: N — M denote the 
inclusion map. If o is any smooth covector field on M, the pullback by 
t yields a smooth covector field 1*o on N. To see what this means, let 
X, € TZN be arbitrary, and compute 


(U"o)p(Xp) = op (tx Xp) 
= 0,(Xp), 


since t,: T,N — T,M is just the inclusion map, under our usual identifica- 
tion of T,.N with a subspace of 7,,M. Thus u*o is just the restriction of o 
to vectors tangent to N. For this reason we often write o|y in place of v*o, 
and call it the restriction of o to N. Be warned, however, that o|, might 
equal zero at a given point of N, even though considered as a covector 
field on M, o might not vanish there. An example will help to clarify this 
distinction. 


Example 5.40. Let o = dy on R?, and let N be the z-axis, considered as 
a submanifold of R?. As a covector field on R?, o does not vanish at any 
point, because one of its components is always 1. However, the restriction 
o|n is identically zero: 


o|n = Udy = d(you) =0, 


because y vanishes identically on N. 


To distinguish the two ways in which we might interpret the statement 
“o vanishes on N,” we will say that ao vanishes along N or vanishes at 
points of N if o, = 0 for every point p € N. The weaker condition that 
a|n = 0 will be expressed by saying that the restriction of o to N vanishes. 


Exercise 5.9. Suppose M is a smooth manifold and N C M is an im- 
mersed submanifold. If f € C°(M), show that d(f|w) = (df)|n~. Conclude 
that if f is constant on N, then the restriction of df to N is zero. 
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Lie Subgroups 


A Lie subgroup of a Lie group G is a subgroup of G endowed with a topology 
and smooth structure making it into a Lie group and an immersed subman- 
ifold of G. The following proposition shows that embedded subgroups are 
automatically Lie subgroups. 


Proposition 5.41. Let G be a Lie group, and suppose H C G is a sub- 
group that is also an embedded submanifold. Then H is a closed Lie sub- 
group of G. 


Proof. We need only check that multiplication H x H — H and inversion 
H — H are smooth maps. Because multiplication is a smooth map from 
GxG into G, its restriction is clearly smooth from H x H into G (this is true 
even if H is merely immersed). Because H is a subgroup, multiplication 
takes H x H into H, and since H is embedded, this is a smooth map into 
HT by Corollary 5.38. A similar argument applies to inversion. This proves 
that H is a Lie subgroup. 

To prove that H is closed, suppose that {h;} is a sequence of points in 
HAT converging to a point g € G. Let U be the domain of a slice chart for 
HT containing the identity, and let W be a smaller neighborhood of e such 
that W Cc U. Since the map p: G x G = G given by (gi, 92) = 9, ge 
is continuous, there is a neighborhood V of the identity with the property 
that V x V C y~!(W), which means that 9 92 € W whenever gi, g2 € V. 

Because g~ th; — e, by discarding finitely many terms of the sequence 
we may assume that g~'h; € V for all i. This implies that 


h5*he = (g7 thy)" *(g7 ha) € W 
for alli and j. Fixing j and letting 7 — oo, we find ho hi => ho*g EeWwcu. 


Since HU is a slice, it is closed in U, and therefore ho*g € H, which 
implies g € H. Thus 4 is closed. 


We will see in Chapter 15 that this proposition has an important con- 
verse, called the closed subgroup theorem. 


Example 5.42 (Lie Subgroups). 


(a) The subset GLt(n,R) C GL(n, R) consisting of real n x n ma- 
trices with positive determinant is a subgroup because det(AB) = 
(det A)(det B). It is an open subset of GL(n,R) by continuity of the 
determinant function, and therefore it is an embedded Lie subgroup 
of dimension n?. 

(b) The circle group S! is a Lie subgroup of C* because it is a subgroup 
and an embedded submanifold. 


(c) 
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The orthogonal group O(n) (the group of n x n orthogonal matrices) 
is an embedded Lie subgroup of GL(n,R) (see Example 5.26). It 
is a compact group because it is a closed and bounded subset of 
M(n, R): closed because it is a level set of the continuous map ®(A) = 
A’ A, and bounded because each column of an orthogonal matrix 
has norm 1, which implies that the Euclidean norm of A € O(n) is 


(44)? = Vn. 
The special linear group SL(n,R) (the set of n x n real matrices 
of determinant 1) is a subgroup and an embedded submanifold of 


codimension 1 in GL(n,R) by Example 5.27. Therefore it is a Lie 
subgroup. 


The special orthogonal group is defined as SO(n) = O(n)NSL(n, R) Cc 
GL(n,R). Because every matrix A € O(n) satisfies 


1 = det I, = det(A? A) = (det A)(det A?) = (det A)?, 


it follows that det A = +1 for all A € O(n). Therefore, SO(n) is the 
open subgroup of O(n) consisting of matrices of positive determinant, 
and is therefore also an embedded Lie subgroup of dimension n(n — 
1)/2 in GL(n, R). It is a compact group because it is a closed subset 
of O(n). 


Let H c T? be the dense immersed submanifold of the torus that is 
the image of the immersion y: R — T? defined in Example 5.3. It is 
easy to check that y is a group homomorphism and therefore H is 
a subgroup of T?. Because the smooth structure on H is defined so 
that y: R — Z is a diffeomorphism, H is a Lie group (in fact, Lie 
isomorphic to R) and is therefore a Lie subgroup of T?. 
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Problems 
5-1. Let F: R? — R be defined by 
F(z,y)=2? +ayty? +1. 
Which level sets of F are embedded submanifolds of R?? 
5-2. Define a map F': P? > R* by 


(x? — y?, zy, £2, y2) 
2 + y? + 22 


Flz,y,z| = 


Show that F is a smooth embedding. 


5-3. Show that the image of the curve y: (—1/2,37/2) > R? of Example 
5.2 is not an embedded submanifold of R?. 


5-4. Let y: R — T? be the curve of Example 5.3. 


(a) Show that the image set 7(IR) is dense in T?. 
(b) Show that 7(R) is not an embedded submanifold of the torus. 


5-5. Let F: R — R? and ©: R* — R be defined by 
F(t) = (t,°), 
O(x,y) = ry. 
(a) show that neither F(R) nor 6~1(0) is an embedded submanifold 
of R*. 


(b) Can either set be given a topology and smooth structure making 
it into an immersed submanifold of R?? [Hint: Consider tangent 
vectors to the submanifold at the origin.] 


5-6. Show that an embedded submanifold is closed if and only if the in- 
clusion map is proper. 


5-7. Suppose M is a smooth manifold, p € M, and y',... ,y” are smooth 
real-valued functions defined on a neighborhood of p in M. 


(a) If dy,,...,dy}? form a basis for T,M, show that (y’,... ,y”) are 
coordinates for M in some neighborhood of p. 

(b) If dy, ...,dy,, are independent, show that there are functions 
yntt,...,y™ such that (y',...,y™) are coordinates for M in 
some neighborhood of p. 

(c) If dy}, ..., dy, span TM, show that there are indices 71,... , ix 
such that (y",... , y’*) are coordinates for M in some neighbor- 
hood of p. 


5-14. 
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. Let M be a smooth compact manifold. Show that there is no smooth 


submersion F': M — R* for any k > 0. 


. Suppose 7: M — N is asmooth map such that every point of M is in 


the image of a smooth local section of 7. Show that 7 is a submersion. 


. Consider the map F': R* — R? defined by 


F(a,y,8,t) = (a7 +y,2 


Show that (0,1) is a regular value of F’, and that the level set 
F~1(0,1) is diffeomorphic to S?. 


. Let S C R? be the square of side 2 centered at the origin: 


S= {(z,y) : max(|2], |y|) = 1. 


If F: S' — § is any homeomorphism, show that either F or F~! is 
not smooth. 


. Show that every bijective bundle map is a bundle isomorphism. More 


precisely, if £ and E’ are vector bundles over a smooth manifold M, 
and F: E — E’ is a bijective bundle map, show that F! is also a 
bundle map. 


. Let F: M — N be a smooth map of constant rank k, and let 


S = F(M). Show that S can be given a topology and smooth struc- 
ture such that it is an immersed k-dimensional submanifold of N 
and fF’: M — S is smooth. Are the topology and smooth structure 
uniquely determined by these conditions? 


Decide whether each of the following statements is true or false, and 
discuss why. 


(a) If F: M — N is asmooth map, c € N, and F~1(c) is an em- 
bedded submanifold of M whose codimension is equal to the 
dimension of N, then c is a regular value of F’. 

(b) If S C M is a closed embedded submanifold, there is a smooth 
map F: M — P such that S is a regular level set of F. 


. Let M CN be a closed embedded submanifold. 


(a) Suppose f € C®(M). (This means that f is smooth when con- 
sidered as a function on M, not as a function on a closed subset 
of N.) Show that f is the restriction of a smooth function on N. 


(b) If X € J(M), show that there is a smooth vector field Y on N 
such that X = Y|a. 
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5-16. 


5-17. 
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(c) Find counterexamples to both results if the hypothesis that M 
is closed is omitted. 


Let N Cc M be a connected immersed submanifold. Show that a 
function f € C°(M) is constant on N if and only if (df)|n = 0. 


If N Cc M is an embedded submanifold and 7: J — M is a smooth 
curve whose image happens to lie in N, show that y‘(t) is in the 
subspace T,(4)N of Ty(4)M for all t € J. Give a counterexample if N 
is not embedded. 


. Let M be a smooth manifold. Two embedded submanifolds N;, No C 


M are said to be transverse (or to intersect transversely) if for each 
pe NiO No, the tangent spaces TN; and T,,N2 together span TM. 
If N, and N2 are transverse, show that N,N is either empty or an 
embedded submanifold of M. Give a counterexample when Nj, and 
No are not transverse. 


. Let M be a smooth n-manifold with boundary. Recall from Chapter 


1 that a point p € M called a boundary point of M if y(p) € OH” for 
some generalized chart (U,y), and an interior point if p(p) € Int H” 
for some generalized chart. Show that the set of boundary points 
and the set of interior points are disjoint. [Hint: If y(p) € OH” and 
w(p) € Int H”, show that pow * is an open map into R” and derive 
a contradiction. ] 


. Let Mi, M2 be connected smooth manifolds of dimension n. For 


i = 1,2, let (Wi, y;) be a coordinate domain centered at some point 
pi € M; such that y;(W;) = Bo(0) C R™. Define U; = vy; *(Bi(0)) C 
W; and M! = M; \ U;. The connected sum of My and M2, denoted 
by # Mpg, is the quotient space of Mj 11 Mé obtained by identifying 
each g € OU, with Yo" ° yi(q) € U2. Show that M,#Mz is con- 
nected, and has a unique smooth n-manifold structure such that the 
restriction of the quotient map to each M/ is an embedding (where 
M} is thought of as a smooth manifold with boundary). Show that 
there are open subsets My, Mo C Mi#Mp that are diffeomorphic to 
M, ~ {pi} and Mz ~ {po}, respectively, and such that M4, 9 Mo is 
diffeomorphic to B2(0) \ {0}. 
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Embedding and Approximation 
‘Theorems 


The purpose of this chapter is to address two fundamental questions about 
smooth manifolds. The questions may seem unrelated at first, but their 
solutions are closely related. 

The first question is “Which smooth manifolds can be smoothly embed- 
ded in Euclidean spaces?” The answer, as we will see, is that they all can. 
This justifies our habit of visualizing manifolds as subsets of R”. 

The second question is “To what extent can continuous maps between 
manifolds be approximated by smooth ones?” We will give two different 
answers, both of which are useful in different contexts. Stated simply, we 
will show that any continuous map from a smooth manifold into R” can be 
uniformly approximated by a smooth map, and that any continuous map 
from one smooth manifold to another is homotopic to a smooth map. 

The essential strategy for answering both questions is the same: first use 
analysis in R” to construct a “local” solution in a fixed coordinate chart; 
then use partitions of unity to piece together the local solutions into a 
global one. 

Before we begin, we need extend the notion of sets of measure zero to 
manifolds. These are sets that are “small” in the sense that is closely related 
to having zero volume (even though we do not yet have a way to measure 
volume quantitatively on manifolds), and include things like countable sets 
and submanifolds of lower dimension. 
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Sets of Measure Zero in Manifolds 


Recall what it means for a set A C R” to have measure zero (see the 
Appendix): for any 6 > 0, A can be covered by a countable collection of 
open cubes whose total volume is less than 6. The next lemma shows that 
cubes can be replaced by balls in the definition. 


Lemma 6.1. A subset A C R” has measure zero if and only if, for every 
6 >0, A can be covered by a countable collection of open balls whose total 
volume is less than 6. 


Proof. This is based on the easily-verified geometric fact that every open 
cube of volume v is contained in an open ball of volume c,,v, and every 
open ball of volume v is contained in an open cube of volume c’,v, where 
Cn and ci), are constants depending only on n. Thus if A has measure zero, 
there is a countable cover of A by open cubes with total volume less than 
6. Enclosing each cube in a ball whose volume is c,, times that of the cube, 
we obtain an open cover of A by open balls of total volume less than c,6, 
which can be made as small as desired by taking 6 sufficiently small. The 
converse is similar. 


We wish to extend the notion of measure zero in a diffeomorphism- 
invariant fashion to subsets of manifolds. Because a manifold does not 
come with a metric, volumes of cubes or balls do not make sense, so we 
cannot simply use the same definition. However, the key is provided by the 
next lemma, which implies that the condition of having measure zero is 
diffeomorphism-invariant for subsets of R”. 


Lemma 6.2. Suppose A C R” has measure zero and F: A > R” is a 
smooth map. Then F(A) has measure zero. 


Proof. By definition, F has an extension, still called F, to a smooth func- 
tion on a neighborhood W of A in R”. Let B be any closed ball contained 
in W. Since B is compact, there is a constant C such that |DF(zx)| < C for 
all x € B. Using the Lipschitz estimate for smooth functions (Proposition 
A.28), we have 


|F (a2) — F(2’)| < Cla — a’ | (6.1) 


for all xa’ € B. ze) 
Given 5 > 0, we can choose a countable cover {B;} of AM B by open 
balls satisfying 


S~ Vol(B;) < 6. 


Then by (6.1), F(B;) is contained in a ball B; whose radius is no more 
than C' times that of B;. Since the volume of a ball in R” is proportional 
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to the nth power of its radius, we conclude that F(ANM B) is contained in 
the collection of balls {B;}, whose total volume is no greater than 


S © Vol(B;) < C6. 
J 
Since this can be made as small as desired, it follows that F(AM B) has 


measure zero. Since F(A) is the union of countably many such sets, it too 
has measure zero. 


Lemma 6.3. Suppose F: U — R” is a smooth map, where U is an open 
subset of R™ andm <n. Then F(U) has measure zero in R”. 


Proof. Let 7: R” — R™ denote the projection onto the first m coordinates, 
and let U = ma 1(U). The result follows by applying the preceding lmma 
to F = For: U > R®, because F(U) = F(U NR”), which is the image of 
a set of measure zero. 


We say a subset A of a smooth n-manifold M has measure zero if for 
every smooth chart (U, y) for M, the set y(ANU) has measure zero in R”. 
It follows immediately from Lemma A.30(c), that any set of measure zero 
has dense complement, because if M \ A is not dense then A contains an 
open set, which would imply (AN V) would contain an open set for some 
coordinate chart (V, 2). 

The following lemma shows that we need only check this condition for a 
single collection of charts whose domains cover A. 


Lemma 6.4. Suppose A is a subset of a smooth n-manifold M, and for 
some collection {(Ua,a)} of charts whose domains cover A, ~a(AN U~) 
has measure zero in R” for each a. Then A has measure zero in M. 


Proof. Let (V, 7) be an arbitrary coordinate chart. We need to show that 
w~(ANV) has measure zero. Some countable collection of the U,’s covers 
AN V. For each such U,,, we have 


WANV NUS) = (bo vy") ° Ya(ANVNU,). 


Now go(ANV NU) is a subset of ya(ANU.), which has measure zero by 
hypothesis. By Lemma 6.2 applied to ~o,", therefore, (ANV MU.) has 
measure zero. Since (A MV) is the union of countably many such sets, it 
too has measure zero. 


As our first application of sets of measure zero in manifolds, we prove 
the following proposition, which is an analogue of Proposition 5.17. 


Proposition 6.5. Let F: M — N be a smooth map of constant rank. 


(a) If F is surjective, then it is a submersion. 
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(b) If F is bijective, then it is a diffeomorphism. 


Proof. As in the proof of Proposition 5.17, let m = dim M, n = dim N, and 
k = rank F. If F is not a submersion, then k < n. By the rank theorem, 
each point has a coordinate neighborhood in which F' has the coordinate 
representation 


Be x66, 8) Se pn eee. 70): (6.2) 


Since any open cover of a manifold has a countable subcover, we can choose 
countably many charts {(U;, ¢;)} for M and corresponding charts {(V;, q;)} 
for N such that the sets {U;} cover M, F maps U; into V;, and the coordi- 
nate representation of F': U; — V; is as in (6.2). Since F(U;) is contained 
in a k-dimensional slice of V;, it has measure zero in N. Because F(M) is 
equal to the countable union of sets F'(U;) of measure zero, F'(M) itself has 
measure zero in N, which implies that F cannot be surjective. This proves 
(a). 

To prove (b), note that a bijective map of constant rank is a submersion 
by part (a) and an immersion by Proposition 5.17, so M and N have the 
same dimension. Then Proposition 5.16 implies that F is a diffeomorphism. 


The next theorem is the main result of this section. 


Theorem 6.6. Suppose M and N are smooth manifolds with dim M < 
dim N, and F: M — N is a smooth map. Then F(M) has measure zero 
in N. In particular, N \ F(M) is dense in N. 


Proof. Write m = dim M and n = dim N, and let {(Ui, yi)} be a countable 
covering of M by coordinate charts. Given any coordinate chart (V,~) for 
N, we need to show that (F'(M)NV) has measure zero in R”. Observe that 
this set is the countable union of sets of the form wo Foy; '(yi(F-1(V)N 
U;)), each of which has measure zero by Lemma 6.3. 


Corollary 6.7. If M is a smooth manifold and N C M is an immersed 
submanifold of positive codimension, then N has measure zero in M. 


Theorem 6.6 can be considered as a special case of the following deeper 
(and somewhat harder to prove) theorem due to Arthur Sard. 


Theorem 6.8 (Sard’s Theorem). /f F: M — N is any smooth map, 
the set of critical values of F has measure zero in N. 


We will neither use nor prove this theorem in this book. For a proof, see 
[Mil65], [Ste64], or [Bre93]. 
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The Whitney Embedding Theorem 


Our first major task in this chapter is to show that every smooth n-manifold 
can be embedded in R?"+!. We will begin by proving that if m > 2n, any 
smooth map into R” can be perturbed slightly to be an immersion. 


Theorem 6.9. Let F: M — R™ be any smooth map, where M is a smooth 
n-manifold and m > 2n. For_any « > 0, there is a smooth immersion 
F: M —R™ such that sup, |F — F| <e. 


Proof. Let {W;} be any regular open cover of M as defined in Chapter 2 (for 
example, a regular refinement of the trivial cover consisting of M alone). 
Then each W; is the domain of a chart #;: W; — Bs(0), and the precompact 
sets U; = u; '(B1(0)) still cover M. For each k € N, let My = ee U;. We 
interpret Mo to be the empty set. We will modify F inductively on one set 
W, at a time. 

Let {y;} be a partition of unity subordinate to {W;}. Let Fo = F, and 
suppose by induction we have defined smooth maps Fj}: M — R"™ for 
j=l,...,k—1 satisfying 


(i) supyy |Fy — Fl <¢; 
(ii) Fj(x) = Fj-1(x) unless « € W;; 
(iii) (Fj). is injective at each point of Mj. 


For any m x n matrix A, define a new map F'4: M — R™ as follows: On 
M ~\ supp yz, F'4 = Fe-1; and on Wx, F'4 is the map given in coordinates 
by 


Fia(x) = Fy-1(2) + ye(x) Az, 


where A: R” — R”™ is thought of as a linear map. (When computing in 
W;,, we simplify the notation by identifying maps with their coordinate 
representations as usual.) Since both definitions agree on the set Wz \ 
supp yx where they overlap, this defines a smooth map. We will eventually 
set Fi, = F', for a suitable choice of A. 

Because (i) holds for 7 = & — 1, there is a constant ¢9 < € such that 
|Fy_-1(a) — F(a)| < €0 for x in the compact set supp y,. By continuity, 
therefore, there is some 6 > 0 such that |A| < 6 implies 


sup|F'4 — Fe-i] = sup” |ye(a)Az| < € —€, 
M TESUPP Yr 


and therefore 


sup |F'4 — F| < sup |F'a — Fx-1| + sup |Fx-1 — F| < (€ — €0) + €0 =e. 
M M M 
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Let P: Wi, x M(m x n, R) — M(m x n,R) be the matrix-valued function 
P(a, A) = DF a(x). 


By the inductive hypothesis, P(#,A) has rank n when (a, A) is in the 
compact set (supp yrM,_1) x {0}. By choosing 6 even smaller if necessary, 
we may also ensure that rank P(x, A) = n whenever x € supp yx 1 Mp1 
and |A| < 6. 

The last condition we need to ensure is that rank(F'4). =n on U; and 
therefore on M;, = My_1 UUx. Notice that DF'4(x) = DF,_1(x) + A for 
x € Ux because yy = 1 there, and therefore DF'4(x) has rank n in U,, if and 
only if A is not of the form B— DF,_1(x) for any x € Ux and any matrix B 
of rank less than n. To ensure this, let Q: Wy x M(mxn,R) — M(mxn,R) 
be the smooth map 


Q(«, B) = B-— DFy_-1(2). 


We need to show that there is some matrix A with |A| < 6 that is not of 
the form Q(a, B) for any x € U; and any matrix B of rank less than n. 
For each j = 0,... ,n—1, the set Mj(mx n, R) of mx n matrices of rank j 
is an embedded submanifold of M(m x n, R) of codimension (m— j)(n— j) 
by Example 5.30. By Theorem 6.6, therefore, Q(W;, x M;(m x n,R)) has 
measure zero in M(m x n, R) provided the dimension of W, x M;(m x n,R) 
is strictly less than the dimension of M(m x n,R), which is to say 


n+mn—(m—j)(n- Jj) <mn 
or equivalently 


n—(m—j)(n—7) <0. (6.3) 


When j = n—1, n-—(m—J)(n—j) = 2n—m-—1, which is negative because 
we are assuming m > 2n. For j <n—1, n—(m-— j)(n— 7) is increasing in 
j because its derivative with respect to 7 is positive there. Thus (6.3) holds 
whenever 0 < 7 < n—1. This implies that for each 7 = 0,...,n —1, the 
image under @ of W, x M;(m x n,R) has measure zero in M(m x n,R). 
Choosing A such that |A] < 6 and A is not in the union of these image 
sets, and setting Fi, = F'4, we obtain a map satisfying the three conditions 
of the inductive hypothesis for 7 = k. 

Now let F(x) = limz—oo Fx (x). By local finiteness of the cover {W;}, for 
each k there is some N(k) > k such that W, 1 W; = © for all 7 > N(k), 
and then condition (ii) implies that Fyy(z) = Fin(n)41 = ++: = Fi on Wy for 
all i > N(k). Thus the sequence {F;,(x)} is eventually constant for x in a 
neighborhood of any point, and so F: M > R™ is a smooth map. It is an 
immersion because F’ = Fy(x) on Wz, which has rank n by (iii). 
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Corollary 6.10 (Whitney Immersion Theorem). Every smooth n- 
manifold admits an immersion into R2”. 


Proof. Just apply the preceding theorem to any smooth map F: M —> R?”, 
for example a constant map. 


Next we show how to perturb our immersion to be injective. The intu- 
ition behind this theorem is that, due to the rank theorem, the image of 
an immersion looks locally like an n-dimensional affine subspace (after a 
suitable change of coordinates), so if F(M) C R™ has self-intersections, 
they will look locally like the intersection between two n-dimensional affine 
subspaces. If m is at least 2n + 1, such affine subspaces of R™ can be 
translated slightly so as to be disjoint, so we might hope to remove the 
self-intersections by perturbing F a little. The details of the proof are a bit 
more involved, but the idea is the same. 


Theorem 6.11. Let M be a smooth n-manifold, and suppose m > 2n +1 
and F: M — R™ is an immersion. Then for any € > 0 there is an injective 
immersion F: M — R™ such that supyy |F — F| <. 


Proof. Because an immersion is locally an embedding, there is an open 
cover {W;} of M such that the restriction of F to each W; is injective. 
Passing to a refinement, we may assume that it is a regular cover. As in 
the proof of the previous theorem, let w;: W; — B3(0) be the associated 
charts, U; = vv; '(B1(0)), and let {y;} be a partition of unity subordinate 
to {Wj}. Let My = ees Ug. 

As before, we will modify F' inductively to make it injective on succes- 
sively larger sets. Let Fo = F, and suppose by induction we have defined 
smooth maps Fj: M — R™ for 7 = 1,...,k —1 satisfying 


(i) F; is an immersion; 
(ii) supa, |F) — F| < ¢; 
(iii) Fj (x) = F)-1(a) unless x € W;; 
(iv) F; is injective on Mj; 
(v) EF; is injective on W; for each 7. 
Define the next map Fy: M — R” by 
Fy (x) = Fy—-1(x) + pe(x)b, 


where b € R™ is to be determined. 

We wish to choose b such that Fy,(a) ¢ Fi,(y) when «x and y are distinct 
points of M;,. To begin, by an argument analogous to that of Theorem 6.9, 
there exists 6 such that |b| < 6 implies 


sup |F, —F| < sup |F, — Fe-i|+sup |Fe-1— F| <e. 
M SUPP Yr M 
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Choosing 6 smaller if necessary, we may also ensure that (F;,)« is injective 
at each point of the compact set supp yx; since (Fy). = (Fe-1)« is already 
injective on the rest of M, this implies that Fy, is an immersion. 

Next, observe that if F(x) = Fi,(y), then exactly one of the following 
two cases must hold: 


CASE I: yp (x) 4 ye (y) and 


Fy—-1(x) — Fei (y) 


US a ieag 


(6.4) 


CASE II: yx(x) = yx(y) and therefore also Fy_1(x) = Fy_-i(y). 
Define an open subset U C M x M by 


U ={(2,y) : yr(x) A ve(y)}, 
and let R: U — R™ be the smooth map 


Fy_-i(x) — Fe—-1(y) 
pr(x) — pr(y) 


Because dim U = dim(M x M) = 2n < m, Theorem 6.6 implies that R(U) 
has measure zero in R™. Therefore there exists b € R™ with |b] < 6 such 
that (6.4) does not hold for any (x,y) € U. With this b, (i)—(iii) hold with 
j =k. We need to show that (iv) and (v) hold as well. 

If Fy(x) = Fy(y) for some x,y € Mg, case I above cannot hold 
by our choice of b. Therefore we are in case II: yx(z) = yr(y) and 
Fy-1(x) = Fe—i(y). Tf pe(x) = ve(y) = 0, then x,y € Ma, \Ux C Me-1, 
contradicting the fact that F,_1 is injective on M,_, by the inductive 
hypothesis. On the other hand, if y,(a) and yx(y) are nonzero, then 
z,y € suppyr C Ws, which contradicts the fact that F,_1 is injective 
on W;, by (v). Similarly, if Fi.(v) = Fi.(y) for some x,y € Wj, the same 
argument shows that F,_1(x) = Fy_1(y), contradicting (v). 

Now we let F(x) = lim;—.0 F(x). As before, for any k, this sequence 
is constant on W;, for 7 sufficiently large, so defines a smooth function. If 
F(a) = F(y), choose k such that x,y € Mj. For sufficiently large j, F = Fj 
on Mx, so the injectivity of F; on M;, implies that x = y. 


R(z, y) = 


We can now prove the main result of this section. 


Theorem 6.12 (Whitney Embedding Theorem). Every smooth n- 
manifold admits an embedding into R?"*+ as a closed submanifold. 


Proof. Let M be a smooth n-manifold. By Proposition 5.4(b), a proper 
injective immersion is an embedding with closed image. We will begin by 
constructing a smooth proper map Fy: M — R?”"+! and using the previous 
two theorems to perturb it to a proper injective immersion. 
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To construct a proper map, let {V;} be any countable open cover of M 
by precompact open sets, and let {y~;} be a subordinate partition of unity. 
Define f € C°(M) by 


f(p) = > J¢;(P). 


For any positive integer N, if p ¢ ees V;, then y;(p) =0 forl<j<N, 
so 


IfPl=f~)= S> joj) > S> Neji) > NS ¢9;(~) =N. 
j=N+41 j=N+41 j=l 


Therefore f~![—N, N] is contained in the compact set ye V;. This im- 
plies that f is proper and so is the map Fy: M — R?"*! defined by 
Fe SF Oise), 

Now by Theorem 6.9, there is an immersion F,: M — R?"*? satisfying 
sup gy |F. — Fo| < 1. And by Theorem 6.11, there is an injective immersion 
Fp: M — R?"*! satisfying sup, |Fo — Fil < 1. If K c R?"*? is any 
compact set, it is contained in some ball Br(0), and thus if F2(p) € K we 
have 


|Fo(p)| < |Fo(p) — Fily)| + |Fi(p) — Fa(p)| + |Fa(p)| S1+1+8, 


which implies F, '() is a closed subset of Fy '(B2+4(0)), which is compact 
because Fo is proper. Thus F is a proper injective immersion and hence 
an embedding. 


This theorem, first proved by Hassler Whitney in 1936 [Whi36], answered 
a question that had been nagging mathematicians since the notion of an 
abstract manifold was first introduced: Are there abstract smooth mani- 
folds that are not diffeomorphic to embedded submanifolds of Euclidean 
space? Although this version of the theorem will be quite sufficient for our 
purposes, it is interesting to note that eight years later [Whi44b, Whi44al, 
using much more sophisticated techniques of algebraic topology, Whitney 
was able to obtain the following improvements. 


Theorem 6.13 (Strong Whitney Immersion Theorem). Jf n > 1, 


every smooth n-manifold admits an immersion into R?"~}. 


Theorem 6.14 (Strong Whitney Embedding Theorem). Every 
smooth n-manifold admits an embedding into R?”. 
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The Whitney Approximation Theorem 


In this section we prove the two theorems mentioned at the beginning of 
the chapter on approximation of continuous maps by smooth ones. 

We begin with the case of maps into Euclidean spaces. The following 
theorem shows, in particular, that any continuous map from a smooth 
manifold M into R*® can be uniformly approximated by a smooth map. 
In fact, for later use, we will prove something stronger. If 6: M > Risa 
positive continuous function, we say two maps FF’: M — R* are 6-close 
if |F(x) — F(x)| < d(x) for alla e M. 


Theorem 6.15 (Whitney Approximation Theorem). Let M be a 
smooth manifold and let F: M — R® be a continuous map. Given a positive 
continuous function 6: M — R, there exists a smooth map F: M > R* 
that is 6-close to F. If F is smooth on a closed subset A C M, then F can 
be chosen to be equal to F on A. 


Proof. If F is smooth on the closed set A, then by definition there is some 
neighborhood U of A on which F'|4 has a smooth extension; call this ex- 
tension Fo. (If there is no such set, we just take U = A = @.) Let 


Uo ={y € U : |Fo(y) — F(y)| < 5(y)}- 


It is easy to verify that Up is an open set containing A. 
We will show that there is a countable open cover {U;} of M ~\ A and 
points v; € R* such that 


|F(y) — v;| < d(y) for all y € Uj. (6.5) 


To see this, for any x € M ~\ A, let U, be a neighborhood of x contained 
in M \ A and small enough that 
d(y) > $6(x) and 
|F(y) — F(x)| < 34(2) 


for all y € U,. Then if y € Uz, we have 
|F(y) — F(2)| < 99(x) < d(y). 


The collection of all such sets U, as x ranges over points of M\~ A is an open 
cover of M ~\ A. Choose a countable subcover {U;,}%2,. Setting U; = Uz, 
and vu; = F(a;), we have (6.5). 

Let {vo, yi} be a partition of unity subordinate to the cover {Upo, U;} of 
M, and define F: M—R* by 


F(y) = goly)Fo(y) + >, gily)ui- 


i> 
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Then clearly F is smooth, and is equal to F' on A. For any y € M, the fact 
that >7j>0 ¢i = | implies that 


|F(y) — F(y)| = |yo(y) Fo(y) + 2 gily)vi — (cotw + y vil) )F 


< vo(y)Fo(y) — FU) + D/ eiwles — F)| 
< go(y)5(y) + - vily)d(y) 
= 6(y), : 


which shows that F is 6-close to F. 


Next we wish to consider a continuous map f°: N — M between smooth 
manifolds. Using the Whitney embedding theorem, we can consider M as 
an embedded submanifold of some Euclidean space R™, and approximate 
F by asmooth map into R™. However, in general, the image of this smooth 
map will not lie in M. To correct for this, we need to know that there is a 
smooth retraction from some neighborhood of M onto M. For this purpose, 
we introduce a few more definitions. 

Let M Cc R™ be an embedded n-dimensional submanifold. Identifying 
the tangent space T,,M at a point p € M with asubspace of T,R” = R™, we 
define the normal space to M at p to be the subspace N,M C R” consisting 
of all vectors that are orthogonal to T;,M with respect to the Euclidean dot 
product. The normal bundle of M is the subset NM Cc R™ x R™ defined 
by 


NM = [| N,M ={(p,v) €R™ x R™: pe M and v € N,M}. 
pEeM 


Lemma 6.16. For any embedded submanifold M Cc R™, the normal bun- 
dle NM is an embedded m-dimensional submanifold of R™ x R™. 


Proof. Let n = dim M. Given any point p € M, there exist slice coordinates 
(y',...,y™) on a neighborhood U of p in R™ such that UN M is defined 


by y"t} =---=y™ =0. Define ®: U x R™ > R™" x R” by 
a) O 
P(x, v) = Calon : sy (ae) @ . Oy! is Urs Oy” ; 7 


Then NMN(U x R™) = @~1(0) because the vectors 0/Oy"|2,...,0/Oy"|x 
span T,M at each point x € U. Using the usual change of basis formula 
for coordinate derivatives, the dot product v - 0/Oy"|, can be expanded as 


2 = (ei) (ven de) = een 


Oy’ |, = 


Us 
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(We write the summation explicitly in the last term because the positions 
of the indices do not conform to the summation convention, as is usual 
when dealing with the Euclidean dot product.) Thus the Jacobian of ® at 
a point x € U is the m x 2m matrix 


The m rows of this matrix are obviously independent, so ® is a submersion 
and therefore NIN (U x R™) is an embedded submanifold. Since the same 
is true in a neighborhood of each point of NM, the result follows. 


The subset M x {0} Cc NM is clearly diffeomorphic to M. We will 
identify this subset with M, and thus consider M itself as a subset of NM. 
The normal bundle comes with a natural projection map 7: NM — M 
defined by m(a,v) = a; it is clearly smooth because it is the restriction of 
the projection R™ x R™ — R™ onto the first factor. 

Define a map FE: NM — R™ by 


E(z,v) =x“+v. 


This just maps each normal space N,M affinely onto the affine subspace 
through x and orthogonal to T,.M. Clearly E is smooth because it is the 
restriction of the addition map R™ x R™ — R™ to NM. A tubular neigh- 
borhood of M is a neighborhood U of M in R™ that is the diffeomorphic 
image under F of an open subset V C NM of the form 


V ={(a,v) € NM: |v| < d(a)}, (6.6) 


for some positive continuous function 6: M— R. 


Theorem 6.17 (Tubular Neighborhood Theorem). Every embed- 
ded submanifold of R™ has a tubular neighborhood. 


Proof. We begin by showing that F is a diffeomorphism in a neighborhood 
of each point of Mc NM. Because NM and R™ have the same dimension, 
it suffices to show that E, is surjective at each point. If v € TM, there 
is a smooth curve y: (—e,¢) — M such that 7(0) = x and ¥’(0) = v. Let 
¥: (—e,€) ~ NM be the curve ¥(t) = (y(t), 0). Then 


Eww =(E03)'(0)= 4] _ ((t) +0) =». 


t=0 
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On the other hand, if w € N,M, then defining o: (—e,e) — NM by 
a(t) = (a, tw), we obtain 


E,w = (Eoa)'(0) = ms (a+ tw) = w. 
lb | 

Since T;,M and N,M span R’, this shows EF, is surjective. By the inverse 
function theorem, EF is a diffeomorphism on a neighborhood of x in NM, 
which we can take to be of the form V5(x) = {(2’, v’) : |a—a’| < 6, |u’| < 5} 
for some 6 > 0. (This uses the fact that M is embedded and therefore its 
topology is induced by the Euclidean metric.) 
To complete the proof, we need to show that there is an open set V of the 
form (6.6) on which F is a global diffeomorphism. For each point « € M, let 
r(a) be the supremum of all 6 such that E is a diffeomorphism on V5(x), or 
r(a) = 1 if this supremum is greater than 1. Then r: M — R is continuous 
for the following reason. Given x, x’ € M, if |x — 2’| < r(x), then by the 
triangle inequality V5(x’) is contained in V,(,)(a) for 6 = r(x) — |x — 2’, 
which implies that r(x’) > r(a) — |x — 2’|, or r(x) — r(x’) < |a — a’|. The 
same is true trivially if |e — 2’| > r(x). Reversing the roles of x and 2’ 
yields the opposite inequality, which shows that |r(x) — r(a’)| < ja — 2’], 
so r is continuous. 

Now let V = {(z,v) € NM : |v| < $r(z)}. We will show that E is 
injective on V. Suppose that (a, v) and (a’,v’) are points in V such that 
E(a,v) = E(a2’,v’). Assume without loss of generality that r(a’) < r(z). 


Then |v’| < $r(x’) < $r(x), and it follows from «+ v = 2’ +v’ that 


|x —a”| = |v—v'| < |v] +|v"| < sr (2) sr(a’) < r(x). 


This implies that both (x,v) and (x’,v’) are in the set V,(,)(x) on which 
E is injective, so (%,v) = (a’,v’). Setting U = E(V), we conclude that 
E: V — U is a smooth bijection and a local diffeomorphism, hence a 
diffeomorphism by Proposition 5.7. Thus U is a tubular neighborhood of 
M. 


The principal reason we are interested in tubular neighborhoods is be- 
cause of the next proposition. Recall that a retraction of a topological space 
X onto a subspace M C X is a continuous map r: X — M such that r|jz 
is the identity map of M. 


Proposition 6.18. Let M Cc R™ be an embedded submanifold and let U 
be a tubular neighborhood of M. Then there exists a smooth retraction of 
U onto M. 


Proof. By definition, there is an open subset V C NM containing M such 
that E: V — U is a diffeomorphism. Just define r: U > M by r = ro E7}, 
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where 7: NM — M is the natural projection. Clearly r is smooth. For 
x € M, note that E(x,0) = 2, so r(x) = 70 E71(2) = n(x,0) = 2, which 
shows that r is a retraction. 


The next theorem gives a form of smooth approximation for continuous 
maps between manifolds. It will have important applications later when we 
study de Rham cohomology. If F,G: M — N are continuous maps, recall 
that a homotopy from F to G is a continuous map H: M x I — N (where 
I = 0,1] is the unit interval) such that 


H(x,0) = F(a), 


H(x,1) = G(a) 


for alla € M. If H(x,s) = F(x) = G(x) for all s € I and all x in some 
subset A C M, the homotopy is said to be relative to A. If there exists a 
homotopy from F’ to G, we say that F and G are homotopic (or homotopic 
relative to A if appropriate). 


Theorem 6.19 (Whitney Approximation on Manifolds). Let N 
and M be smooth manifolds, and let F: N — M be a continuous map. 
Then F' is homotopic to a smooth map F: N — M. If F is smooth on a 
closed subset A C N, then the homotopy can be taken to be relative to A. 


Proof. By the Whitney embedding theorem, we may as well assume that 
M is an embedded submanifold of R™. Let U be a tubular neighborhood 
of M in R™, and let r: U — M be the smooth retraction given by Lemma 
6.18. For any x € M, let 


d(a) = sup{e <1: B.(a) Cc U}. 


By a triangle-inequality argument entirely analogous to the one in the proof 
of the tubular neighborhood theorem, 6: M — R is continuous. 

Let 6 = 60 F: N —R. By the Whitney approximation theorem, there 
exists a smooth map fF’: N — R™ that is a 6-approximation to F’, and 
is equal to F on A (which might be the empty set). Define a homotopy 
H:NxI—M by 


H(p,t) = r((1 — t)F(p) + tF(p)). 


This is well defined, because our condition on F guarantees that for each 
p, |F(p) — F(p)| < 6(p) = 6(F(p)), which means that F'(p) is contained in 
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the ball of radius 6(F(p)) around F(p); since this ball is contained in U, so 
is the entire line segment from F'(p) to F(p). 
Thus H is a homotopy between Ho(p) = H(p,0) and Hi(p) = H(p,1). It 


satisfies H(p,t) = F(p) for all p € A, since F = F there. Clearly Hp = F 
from the definition, and Hi(p) = r(F(p)) is smooth. 


If M and N are smooth manifolds, two smooth maps F,G: M — N are 
said to be smoothly homotopic if there is a smooth map H: M x I — N 
that is a homotopy between F and G. 


Proposition 6.20. If F,G: M — N are homotopic smooth maps, then 
they are smoothly homotopic. If F is homotopic to G relative to some closed 
subset A C M, then they are smoothly homotopic relative to A. 


Proof. Let H: M x I > M be a homotopy from F to G (relative to A, 
which may be empty). We wish to show that H can be replaced by a smooth 
homotopy. 

Because Theorem 6.19 does not apply directly to manifolds with bound- 
ary, we first need to extend H to a manifold without boundary containing 
M x I. Let J = (—e,1+ 6) for some ¢ > 0, and define H: M x J > M by 


H(a,t) te [0,1] 
H(z,t) =< H(z,0) t<0 
H(x,1) t>1. 


This is continuous by the gluing lemma [{Lee00, Lemma 3.8]. Moreover, the 
restriction of H to M x {0}UM x {1} is smooth, because it is equal to 
Fom, on M x {0} and Gom, on M x {1} (where 71: M x I > M is the 
projection on the first factor). If F ~ G relative to A, H is also smooth 
on A x J. Therefore, Theorem 6.19 implies that there is a smooth map 
H: M x J — N (homotopic to H, but we do not need that here) whose 
restriction to M x {0} U M x {1}U A x I equals H (and therefore H). 
Restricting back to M x I again, we see that H lax is asmooth homotopy 
(relative to A) between F' and G. 
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Problems 


6-1. 


6-2. 


6-3. 


6-4. 


6-5. 


Show that any two points in a connected smooth manifold can be 
joined by a smooth curve segment. 


Let M Cc R™ be an embedded submanifold, let U be a tubular neigh- 
borhood of M, and let r: U — M be the retraction defined in Propo- 
sition 6.18. Show that U can be chosen small enough that for each 
x € U, r(x) is the point in M closest to x. [Hint: First show that 
each point x € U has a closest point y € M, and this point satisfies 
(x—y) 1 T,M)] 


If M Cc R™ is an embedded submanifold and ¢« > 0, let Mz be the set 
of points in R™ whose distance from M is less than «. If M is com- 
pact, show that for sufficiently small ¢, 0M, is a compact embedded 
submanifold of R™, and M, is a smooth manifold with boundary. 


Let M Cc R™ be an embedded submanifold of dimension n. For each 
p € M, show that there exist a neighborhood U of pin M and smooth 
maps X1,...,Xm—n: U > R” such that (X1i(q),...,Xm—n(q)) form 
an orthonormal basis for NgM at each point gq € U. [Hint: Let (y’) 
be slice coordinates and apply the Gram-Schmidt algorithm to the 
vectors 0/Oy’.] 


Let M@ Cc R™ be an embedded submanifold, and let NM be its normal 
bundle. Show that NM is a vector bundle with projection 7: NM — 
M. |Hint: Use Problem 6-4.] 


4 
Lie Group Actions 


In this chapter, we continue our study of Lie groups. Because their most 
important applications involve actions by Lie groups on other manifolds, 
this chapter concentrates on properties of Lie group actions. 

We begin by defining Lie group actions on manifolds and explaining 
some of their main properties. The main result of the chapter is a theorem 
describing conditions under which the quotient of a smooth manifold by 
a group action is again a smooth manifold. At the end of the chapter, 
we explore two classes of such actions in more detail: actions by discrete 
groups, which are closely connected with covering spaces, and transitive 
actions, which give rise to homogeneous spaces. 


Group Actions on Manifolds 


The importance of Lie groups stems primarily from their actions on mani- 
folds. Let G be a Lie group and M a smooth manifold. A left action of G 
on M isamap G x M — M, often written as (g,p) > g- p, that satisfies 


91 * (92° P) = (9192)  P, (7.1) 


€-p=D. 
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A right action is defined analogously as a map M x G — M with compo- 
sition working in the reverse order: 


(p+ 91) +92 =P+ (9192), 


pre=D. 


A manifold M endowed with a specific G-action is called a (left or right) 
G-space. 

Sometimes it is useful to give a name to an action, suchas 6: Gx M — M, 
with the action of a group element g on a point p usually written 0,(p). In 
terms of this notation, the conditions (7.1) for a left action read 


Io, Ag = 91.925 (7 2) 
be = Idu, 


while for a right action the first equation is replaced by 


95, 2 9g = 9 gog1- 


For left actions, we will generally use the notations g - p and 0,(p) inter- 
changeably. The latter notation contains a bit more information, and is use- 
ful when it is important to specify the specific action under consideration, 
while the former is often more convenient when the action is understood. 
For right actions, the notation p+ g is generally preferred because of the 
way composition works. 

A right action can always be converted to a left action by the trick of 
defining g- p to be p~ g~!; thus any results about left actions can be 
translated into results about right actions, and vice versa. We will usually 
focus our attention on left actions, because their group law (7.2) has the 
property that multiplication of group elements corresponds to composition 
of functions. However, there are some circumstances in which right actions 
arise naturally; we will see several such actions later in this chapter. 

Let us introduce some basic terminology regarding Lie group actions. Let 
0:Gx M—M be aleft action of a Lie group G on a smooth manifold M. 
(The definitions for right actions are analogous.) 


e The action is said to be smooth if it is smooth as a map from G x M 
into M, that is, if @,(p) depends smoothly on (g, p). If this is the case, 
then for each g € G, the map 6,: M — M is a diffeomorphism, with 
inverse 04-1. 


e For any p € M, the orbit of p under the action is the set 
G-p={g-p:g€ G}, 


the set of all images of p under elements of G. 
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e The action is transitive if for any two points p,q € M, there is a group 
element g such that g- p= q, or equivalently if the orbit of any point 
is all of M. 


e Given p € M, the isotropy group of p, denoted by Gp, is the set of 
elements g € G that fix p: 


Gp={g€G:g-p=p}. 


e The action is said to be free if the only element of G that fixes any 
element of M is the identity: g -p = p for some p € M implies g = e. 
This is equivalent to the requirement that G, = {e} for every p € M. 


e The action is said to be proper if the map G x M — M x M given by 
(g,p) > (g+p,p) is a proper map (i.e., the preimage of any compact 
set is compact). (Note that this is not the same as requiring that the 
map G x M — M defining the action be a proper map.) 


It is not always obvious how to tell whether a given action is proper. The 
following alternative characterization of proper actions is often useful. 


Lemma 7.1. Suppose a Lie group G acts smoothly on a smooth manifold 
M. The action is proper if and only if for every compact subset K C M, 
the set Gr ={g EG: (g- K)NK 4 @} is compact. 


Proof. Let 0: Gx M — M x M denote the map O(g,p) = (g - p,p). 
Suppose first that O is proper. Then for any compact set K C M, it is easy 
to check that 


Gr ={g € G: there exists p € K such that g-p€ K} 
= {g © G: there exists p € M such that O(g,p) € K x kK} 
=1G(07'(K x K)), 
where 7g: Gx M — Gis the projection. Thus Gx is compact. Conversely, 
suppose Gx is compact for every compact set K C M. If LC M x M is 


compact, let K = 7 (L) Um2(L) C M, where 7,72: M x M — M are the 
projections on the first and second factors, respectively. Then 


@7'(L) CO"'(K x K) C{(g,p):g- pe K andpe K}CGrx K. 


Since O~!(L) is closed by continuity, it is a closed subset of the compact 
set Gx x K and is therefore compact. 


One special case in which this condition is automatic is when the group 
is compact. 


Corollary 7.2. Any smooth action by a compact Lie group on a smooth 
manifold is proper. 
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Proof. Let G be a compact Lie group acting smoothly on M. For any 
compact set K C M, the set Gx is closed in G by continuity, and therefore 


is compact. 


Example 7.3 (Lie group actions). 


(a) 


The natural action of GL(n,R) on R” is the left action given by 
matrix multiplication: (A,x) +> Az, considering « € R” as a column 
matrix. This is an action because matrix multiplication is associative: 
(AB)«x = A(Bz). It is smooth because the components of Ax depend 
polynomially on the matrix entries of A and the components of 2. 
Because any nonzero vector can be taken to any other by a linear 
transformation, there are exactly two orbits: {0} and R” \ {0}. 


The restriction of the natural action to O(n) x R” — R” defines a 
smooth left action of O(n) on R”. In this case, the orbits are the 
origin and the spheres centered at the origin. To see why, note that 
any orthogonal linear transformation preserves norms, so O(n) takes 
the sphere of radius R to itself; on the other hand, any vector of 
length R can be taken to any other by an orthogonal matrix. (If v 
and v’ are such vectors, complete v/|v| and v’/|v’| to orthonormal 
bases and let A and A’ be the orthogonal matrices whose columns 
are these orthonormal bases; then it is easy to check that A’A~+ takes 
v to v’.) 


Further restricting the natural action to O(n) x S"-1 — S"~!, we 
obtain a transitive action of O(n) on S”~!. It is smooth by Corollary 
5.38, because S"—! is an embedded submanifold of R”. 


The natural action of O(n) restricts to an action of SO(n) on S71. 
When n = 1, this action is trivial because SO(1) is the trivial group 
consisting of the matrix (1) alone. But when n > 1, SO(n) acts tran- 
sitively on S”~!. To see this, it suffices to show that for any v € S”, 
there is a matrix A € SO(n) taking the first standard basis vector e1 
to v. Since O(n) acts transitively, there is a matrix A € O(n) taking 
e, to v. Either det A = 1, in which case A € SO(n), or det A = —1, 
in which case the matrix obtained by multiplying the last column of 
A by —1 is in SO(n) and still takes e; to v. 


Any representation of a Lie group G on a finite-dimensional vector 
space V is a smooth action of G on V. 


Any Lie group G acts smoothly, freely, and transitively on itself by 
left or right translation. More generally, if H is a Lie subgroup of G, 
then the restriction of the multiplication map to H x G — G defines 
a smooth, free (but generally not transitive) left action of H on G; 
similarly, restriction to G x H — G defines a free right action of H 
on G. 
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(g) An action of a discrete group [ on a manifold M is smooth if and 
only if for each g € I, the map pr g- pis a smooth map from M 
to itself. Thus, for example, Z” acts smoothly on the left on R” by 
translation: 


Equivariant Maps 


Suppose M and N are both (left or right) G-spaces. A smooth map 
F: M — N is said to be equivariant with respect to the given G-actions if 
for each g € G, 


F(g-p) = 9- F(p) (for left actions), 
F(p-g) = F(p) +g (for right actions). 


Equivalently, if @ and y are the given actions on M and N, respectively, F 
is equivariant if the following diagram commutes for each g € G: 


M F 


N 
0, Yo 


M 


EF N. 


This condition is also expressed by saying that F' intertwines the two G- 
actions. 


Example 7.4. Let G and H be Lie groups, and let F: G — H bea 
Lie homomorphism. There is a natural left action of G on itself by left 
translation. Define a left action 0 of G on H by 


To check that this is an action, we just observe that 0.(h) = F(e)h = h, 
and 


Og, 0 Oq2(h) = F(gi)(F(g2)h) = (F (91) F(g2))h = F(giga)h = Oq192(h) 


because F’ is a homomorphism. With respect to these G-actions, F is 
equivariant because 


69° F(g') = F(g)F(g’) = F(gg') = Fo L(g’). 
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The following theorem is an extremely useful tool for proving that certain 
sets are embedded submanifolds. 


Theorem 7.5 (Equivariant Rank Theorem). Let M and N_ be 
smooth manifolds and let G be a Lie group. Suppose F: M — N is a smooth 
map that is equivariant with respect to a transitive smooth G-action on M 
and any smooth G-action on N. Then F has constant rank. In particular, 
its level sets are closed embedded submanifolds of M. 


Proof. Let @ and y denote the G-actions on M and N, respectively, and let 
po be any point in M. For any other point p € M, choose g € G such that 
6,(po) = p. (Such a g exists because we are assuming G acts transitively 
on M.) Because y, 0 F = Fo 64, the following diagram commutes: 


Fi. 
Ty) M+ Trp.) N 


a, | |. 


TpM a Trp) N. 
Because the vertical linear maps in this diagram are isomorphisms, the 
horizontal ones have the same rank. In other words, the rank of F, at an 
arbitrary point p is the same as its rank at po, so F’ has constant rank. 


Here are some applications of the equivariant rank theorem. 


Proposition 7.6. Let F: G — H be a Lie group homomorphism. The 
kernel of F is an embedded Lie subgroup of G, whose codimension is equal 
to the rank of F. 


Proof. As in Example 7.4, F' is equivariant with respect to suitable G- 
actions on G and H. Since the action on G by left translation is transitive, 
it follows that F has constant rank, so its kernel F~1(0) is an embedded 
submanifold. It is thus a Lie subgroup by Proposition 5.41. 


As another application, we describe some important Lie subgroups of 
GL(n, C). For any complex matrix A, let A* denote the adjoint or conjugate 
transpose of A: A* = A’. Observe that (AB)* = (AB)? = BT A? = B* A*. 
Consider the following subgroups of GL(n, C): 


e THE COMPLEX SPECIAL LINEAR GROUP: 


SL(n, C) = {A € GL(n, C) : det A = 1}. 


e THE UNITARY GROUP: 


U(n) = {A € GL(n,C) : A*A= Ip}. 
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e THE SPECIAL UNITARY GROUP: 
SU(n) = U(n) NSL(n, C). 


Exercise 7.1. Show that SL(n,C), U(n), and SU(n) are subgroups of 
GL(n, C) (in the algebraic sense). 


Exercise 7.2. Show that a matrix is in U(n) if and only if its columns 
form an orthonormal basis for C” with respect to the Hermitian dot product 
Z-w=d5, zu. 


Proposition 7.7. The unitary group U(n) is an embedded n?-dimensional 
Lie subgroup of GL(n,C). 
Proof. Clearly U(n) is a level set of the map ®: GL(n,C) — M(n,C) 
defined by 

@(A) = A*A. 


To show that ® has constant rank and therefore that U(n) is an embedded 
Lie subgroup, we will show that ® is equivariant with respect to suitable 
right actions of GL(n, C). Let GL(n, C) act on itself by right multiplication, 
and define a right action of GL(n,C) on M(n,C) by 


X-B=B*XB for X €M(n,C), BE GL(n,C). 


It is easy to check that this is a smooth action, and ® is equivariant because 
®(AB) = (AB)*(AB) = B* A* AB = B*O(A)B = O(A) - B. 


Thus U(n) is an embedded Lie subgroup of GL(n, C). 

To determine its dimension, we need to compute the rank of &. Because 
the rank is constant, it suffices to compute it at the identity I, € GL(n, C). 
Thus for any B € T;, GL(n,C) = M(n,C), let 7: (—e,¢) — GL(n,C) be 
the curve y(t) = I, +tB, and compute 


6,B= — 


The image of this linear map is the set of all Hermitian n x n matrices, 
ie., the set of A € M(n,C) satisfying A = A*. This is a (real) vector space 
of dimension n?, as you can check. Therefore U(n) is an embedded Lie 


subgroup of dimension 2n? — n? = n?. 
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Proposition 7.8. The complex special linear group SL(n, C) is an embed- 
ded (2n? — 2)-dimensional Lie subgroup of GL(n,C). 


Proof. Just note that SL(n,C) is the kernel of the Lie group homomor- 
phism det: GL(n,C) — C*. It is easy to check that the determinant is 
surjective onto C*, so it is a submersion by Proposition 6.5(a). Therefore 
SL(n,C) = Ker(det) is an embedded Lie subgroup whose codimension is 
equal to dim C* = 2. 


Proposition 7.9. The special unitary group SU(n) is an embedded (n? — 
1)-dimensional Lie subgroup of GL(n, C). 


Proof. We will show that SU(n) is an embedded submanifold of U(n). Since 
the composition of embeddings SU(n) — U(n) — GL(n,C) is again an 
embedding, SU(n) is also embedded in GL(n, C). 

If A € U(n), then 


1 = det I, = det(A* A) = (det A)(det A*) = (det A)(det A) = | det A|?. 


Thus det: U(n) — C* actually takes its values in S!. It is easy to check 
that it is surjective onto S, so it is a submersion by Proposition 6.5(a). 
Therefore its kernel SU(n) is an embedded Lie subgroup of codimension 1 
in U(n). 


Exercise 7.3. Use the techniques developed in this section to give simpler 
proofs that O(n) and SL(n, R) are Lie subgroups of GL(n, R). 


Quotients of Manifolds by Group Actions 


Suppose a Lie group G acts on a manifold M (on the left, say). The set 
of orbits of G in M is denoted by M/G; with the quotient topology, it is 
called the orbit space of the action. Equivalently, M/G is the quotient space 
of M determined by the equivalence relation p; ~ p2 if and only if there 
exists g € G such that g - p; = pg. It is of great importance to determine 
conditions under which an orbit space is a smooth manifold. 

One simple but important example to keep in mind is the action of R* 
on R* x R” by translation in the R* factor: 6,(x,y) = (v+2,y). The orbits 
are the affine subspaces parallel to R’, and the orbit space (R* x R”)/R* 
is diffeomorphic to R”. The quotient map 7: R* x R” — (R* x R”)/R* is 
a smooth submersion. 

It is worth noting that some authors use distinctive notations such as 
M/G and G\M to distinguish between orbit spaces determined by left 
actions and right actions. We will rely on the context, not the notation, to 
distinguish between the two cases. 

The following theorem gives a very general sufficient condition for the 
quotient of a smooth manifold by a group action to be a smooth manifold. 
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It is one of the most important applications of the inverse function theorem 
that we will see. 


Theorem 7.10 (Quotient Manifold Theorem). Suppose a Lie group 
G acts smoothly, freely, and properly on a smooth manifold M. Then 
the orbit space M/G is a topological manifold of dimension equal to 
dim M — dim G, and has a unique smooth structure with the property that 
the quotient map 7: M — M/G is a smooth submersion. 


Proof. First we prove the uniqueness of the smooth structure. Suppose 
M/G has two different smooth structures such that 7: M — M/G isa 
smooth submersion. Let (M//G), and (M/G)2 denote M/G with the first 
and second smooth structures, respectively. By Proposition 5.19, the iden- 
tity map is smooth from (M/G), to (M/G)2: 


M 


(M/@), => (M/Q)p. 
The same argument shows that it is also smooth in the opposite direction, 
so the two smooth structures are identical. 

Next we prove that M/G is a topological manifold. Assume for definite- 
ness that G acts on the left, and let 06: G x M — M denote the action and 
0: Gx M — M x M the proper map O(g,p) = (g- p,p). For any open 
set U C M, x~'(x(U)) is equal to the union of all sets of the form 6,(U) 
as g ranges over G. Since 6, is a diffeomorphism, each such set is open, 
and therefore 7~!(7(U)) is open in M. Because 7 is a quotient map, this 
implies that 7(U) is open in M/G, and therefore 7 is an open map. 

If {U;} is a countable basis for the topology of M, then {7(U;)} is a 
countable collection of open subsets of M/G, and it is easy to check that 
it is a basis for the topology of M/G. Thus M/G is second countable. 

To show that //G is Hausdorff, define the orbit relation 0 C Mx M by 


O=O0(Gx M)={(g-p,p)E Mx M: pe M, gEG}. 


(It is called the orbit relation because (q,p) € O if and only if p and q are 
in the same G-orbit.) Since proper maps are closed, it follows that © is a 
closed subset of M x M. If a(p) and (gq) are distinct points in M/G, then p 
and q lie in distinct orbits, so (p,q) ¢ 0. If Ux V is a product neighborhood 
of (p,q) in M x M that is disjoint from 0, then 7(U) and (V) are disjoint 
open subsets of M/G containing m(p) and 7(q), respectively. Thus M//G is 
Hausdorff. 

Before proving that M/G is locally Euclidean, we will show that the 
G-orbits are embedded submanifolds of M diffeomorphic to G. For any 
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p € M, define a smooth map 0): G — M by 0)(g) = gp. Note that 
the image of 6") is exactly the orbit of p. We will show that 0) is an 
embedding. First, if @)(g’) = @)(g), then g’ - p = g- p, which implies 
(g-+g') - p = p. Since we are assuming G acts freely on M, this can only 
happen if g~!g’ = e, which means g = g’; thus 6) is injective. Observe 
that 


6) (9'g) = (g'9) -p=9' + (g- vp) =g' - 0g), 


so 0) is equivariant with respect to left translation on G and the given 
action on M. Since G acts transitively on itself, this implies that @@) has 
constant rank. Since it is also injective, it is an immersion by Proposition 
5.17. 

If K C M isa compact set, then (6?))~1(K) is closed in G by continuity, 
and since it is contained inGn ={g€G:(g-K)AK £ @}, it is compact 
by Lemma 7.1. Therefore, 0) is a proper map. We have shown that 0) is 
a proper injective immersion, so it is an embedding by Proposition 5.4(b). 

Let k = dim G and n = dim M—dim G. Let us say that a coordinate chart 
(U,y) on M, with coordinate functions (x,y) = (z!,...,2*,y?,...,y”), is 
adapted to the G-action if 


(i) y(U) is a product open set U, x Uz C R* x R”, and 


(ii) each orbit intersects U either in the empty set or in a single slice of 
the form {y! =c',...,y" =c"}. 


We will show that for any p € M, there exists an adapted coordinate 
chart centered at p. To prove this, we begin by choosing any slice chart 
(W, w) centered at p for the orbit G-p in M. Write the coordinate functions 
of ~ as (ut,...,u*,v',...,v”), so that (G - p) NU is the slice {vt =--- = 
uv” = 0}. Let S be the submanifold of U defined by ul = --» = u® = 0. 
(This is the slice “perpendicular” to the orbit in these coordinates.) Thus 
T,M decomposes as the following direct sum: 


TpM =T»(G + p) ® TS, 


where T;,(G - p) is the span of (0/0u') and T,S' is the span of (0/0v"). 

Let w: G x S — M denote the restriction of the action 6 to Gx SC 
G x M. We will use the inverse function theorem to show that yw is a 
diffeomorphism in a neighborhood of (e,p) € Gx S. Let ip: G- Gx S 
be the embedding given by ip(g) = (g,p). The orbit map 0): G — M is 
equal to the composition 


CAE 6 aT 


Since 9) is an embedding whose image is the orbit G - p, it follows that 
gl?) (T-G) is equal to the subspace T,(G +p) C T,M, and thus the image of 
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Wx: Te,p)(G xX S) — TpM contains T,(G - p). Similarly, if j.: S - Gx S is 
the embedding j-(q) = (e,q), then the inclusion 1: S — M is equal to the 
composition 


Sty Ax gM: 


Therefore, the image of w, also includes T,S C T,M. Since T,(G - p) and 
T,S together span T,M, «: Tye,p)(G x S) — TpM is surjective, and for 
dimensional reasons, it is bijective. By the inverse function theorem, there 
exist a neighborhood (which we may assume to be a product neighborhood) 
X x Y of (e,p) in G x S and a neighborhood U of p in M such that 
w: X x Y — U is a diffeomorphism. Shrinking X and Y if necessary, we 
may assume that X and Y are precompact sets that are diffeomorphic to 
Euclidean balls in R* and R”, respectively. 

We need to show that Y can be chosen small enough that each G-orbit 
intersects Y in at most a single point. Suppose this is not true. Then if {Y;} 
is a countable neighborhood basis for Y at p (e.g., a sequence of Euclidean 
balls whose diameters decrease to 0), for each 7 there exist distinct points 
pi, p, € Y; that are in the same orbit, which is to say that g; - pj = p’, for 
some g; € G. Now, the points O(g;, p;) = (gi + pi, Di) = (5, pi) all lie in the 
compact set Y x Y, so by properness of 0, their inverse images (g;, p;) must 
lie in a compact set L C G x M. Thus the points g; all lie in the compact 
set ™g(L) C G. Passing to a subsequence, we may assume that g; > 9g € G. 
Note also that both sequences {p;} and {p;} converge to p, since p;, p;, € Y; 
and {Y;} is a neighborhood basis at p. By continuity, therefore, 


g- p= lim g- p; = lim p;, =p. 
I— CoO ICO 


Since G acts freely, this implies g = e. When 2 gets large enough, therefore, 
gi € X. But this contradicts the fact that @ is injective on X x Y, because 


99,(pi) = Di, = Oe(p4), 


and we are assuming p; # pj. 

Choose diffeomorphisms a: BY — X and 3: B” — Y (where B* and B” 
are the open unit balls in R* and R”, respectively), and define y: B* x 
B” — U by ¥(2,¥) = Paz) (G(y)). Because y is equal to the composition of 
diffeomorphisms 


BK x eel a gee gee Ba 


7 is a diffeomorphism. The map y = y~* is therefore a coordinate map on 
U. We will show that y is adapted to the G-action. Condition (i) is obvious 
from the definition. Observe that each y = constant slice is contained in a 
single orbit, because it is of the form 6(X x {po}) C O(G x {po}) = G+ po, 
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where po € Y is the point whose y-coordinate is the given constant. Thus if 
an arbitrary orbit intersects U, it does so in a union of y = constant slices. 
However, since an orbit can intersect Y at most once, and each y = constant 
slice has a point in Y, it follows that each orbit intersects U in precisely 
one slice if at all. This completes the proof that adapted coordinate charts 
exist. 

To finish the proof that M/G is locally Euclidean, let ¢ = m(p) be an 
arbitrary point of M/G, and let (U,y) be an adapted coordinate chart 
centered at p, with y(U) = U1 x Uz c R* x R™. Let V = x(U), which is 
an open subset of 1//G because 7 is an open map. Writing the coordinate 
functions of y as (x!,...,a*,y',...,y”) as before, let Y C U be the slice 
{gl =...=a* = 0}. Note that 7: Y — V is bijective by the definition of 
an adapted chart. Moreover, if W is an open subset of Y, then 


mW) = n({(x,y) : (0,4) € WH) 


is open in M/G, and thus z|y is a homeomorphism. Let o = (t|y)~!: V > 
Y CU, which is a local section of z. 

Define a map 7: V — U2 by sending the equivalence class of a point (x, y) 
to y; this is well defined by the definition of an adapted chart. Formally, 
7] = 720° poo, where 7: U, x Uz — U2 C R” is the projection onto the 
second factor. Because o is a homeomorphism from V to Y and mz 0y isa 
homeomorphism from Y to U2, it follows that 7 is a homeomorphism. This 
completes the proof that M/G is a topological n-manifold. 

Finally, we need to show that M/G has a smooth structure such that 
m is a submersion. We will use the atlas consisting of all charts (V,7) as 
constructed in the preceding paragraph. With respect to any such chart 
for M/G and the corresponding adapted chart for M, 7 has the coordinate 
representation m(2,y) = y, which is certainly a submersion. Thus we need 
only show that any two such charts for M//G are smoothly compatible. 

Let (U,y) and (U,) be two adapted charts for M, and let (V,7) and 
(V,7) be the corresponding charts for M//G. First consider the case in 
which the two adapted charts are both centered at the same point p € M. 
Writing the adapted coordinates as (x,y) and (%,y), the fact that the co- 
ordinates are adapted to the G-action means that two points with the 
same y-coordinate are in the same orbit, and therefore also have the same 
y-coordinate. This means that the transition map between these coordi- 
nates can be written (Z,y) = (A(z, y), B(y)), where A and B are smooth 
functions defined on some neighborhood of the origin. The transition map 
non} is just y = B(y), which is clearly smooth. 

In the general case, suppose (U, y) and (U ,) are adapted charts for M, 
and p € U, p € U are points such that m(p) = (p) = q. Modifying both 
charts by adding constant vectors, we can assume that they are centered at 
p and p, respectively. Since p and 7p are in the same orbit, there is a group 
element g such that g-p = p. Because 6, is a diffeomorphism taking orbits 
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to orbits, it follows that y’ = Yo @, is another adapted chart centered at 
p. Moreover, a’ = a5 o@ is the local section corresponding to ¢’, and 
therefore 7 = 120 G06! = 120G00,007' 06 =120 G00 =7. Thus we 
are back in the situation of the preceding paragraph, and the two charts 
are smoothly compatible. 


Covering Manifolds 


Proposition 2.8 showed that any covering space of a smooth manifold is 
again a smooth manifold. It is often important to know when a space 
covered by a smooth manifold is itself a smooth manifold. To understand 
the answer to this question, we need to study the covering group of a 
covering space. In this section, we assume knowledge of the basic properties 
of topological covering maps, as developed for example in [Lee00, Chapters 
11 and 12). 

Let M and M be topological spaces, and let 7: M—>M be a covering 
map. A covering transformation of 7 is a homeomorphism y: M — M such 
that Top=T: 


eS 2) pe 
M M 


oe ia 


The set C, (M ) of all covering transformations, called the covering group 
of 7, is a group under composition, acting on M on the left. The covering 
group is the key to constructing smooth manifolds covered by M. 

We will see below that for a smooth covering 7: M — M, the covering 
group acts smoothly, freely, and properly on the covering space M/. Before 
proceeding, it is useful to have an alternative characterization of properness 
for actions of discrete groups. 


Lemma 7.11. Suppose a discrete group I’ acts continuously on a topolog- 
ical manifold M. The action is proper if and only if the following condition 
holds: 


Any two points p,p' € M have neighborhoods U,U' such (7.3) 
that the set {g ET: (g-U)NUW FO} is finite. ; 


Proof. First suppose that [ acts properly, and let O: T x M—>MxM 
be the map O(g,p) = (g + p,p). Let U,U’ be precompact neighborhoods 
of p and p’, respectively. If (7.3) does not hold, then there exist infinitely 
many distinct elements g; € I and points pj € U such that g; - p; € U’. 
Because the pairs (g; « pi, pi) = O(gi, pi) lie in the compact set U’ x U, the 
preimages (g;,p;) lie in a compact subset of T x M , and therefore have a 
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convergent subsequence. But this is impossible, because {g;} is an infinite 
sequence of distinct points in a discrete space. oh ks 

Conversely, suppose (7.3) holds. If Z is any compact subset of M x M, 
we need to show that O~1(L) C TI x M is compact. It suffices to show 
that any sequence {(g;,p;)} C O~1(L) has a convergent subsequence. Thus 
suppose O(g;, pi) = (gi - pi, pi) € L for all i. By compactness of L, we can 
replace this sequence by a subsequence such that pj — p and g; + pj — p’. 
Let U, U’ be neighborhoods of p and p’, respectively, satisfying property 
(7.3). For all sufficiently large i, p; € U and g; - p; € U’. Since there are 
only finitely many g € T for which (g-U) NU’ # @, this means that there 
is some g € TI such that g; = g for infinitely many 7; in particular, some 
subsequence of (g;,p;) converges. 


Exercise 7.4. Suppose [is a discrete group acting continuously on a topo- 
logical manifold M7. Show that the action is proper if and only if both of the 
following conditions are satisfied: 


(i) Each pé€ M has a neighborhood U such that (g-U)QNU =@ for all 
but finitely many g € [. 


(ii) Ifp,p’ € M are not in the same T-orbit, there exist neighborhoods U 
of p and U’ of p’ such that (g- U) NU’ =@ for all g ET. 


A continuous discrete group action satisfying conditions (i) and (ii) of the 
preceding exercise (or condition (7.3) of Lemma 7.11, or something closely 
related to these) has traditionally been called properly discontinuous. Be- 
cause the term “properly discontinuous” is self-contradictory (properly dis- 
continuous group actions are, after all, continuous!), and because there is 
no general agreement about exactly what the term should mean, we will 
avoid using this terminology and stick with the more general term “proper 
action” in this book. 


Proposition 7.12. Let 7: M—3M be_a smooth covering map. With the 
discrete topology, the covering group C,(M) is a zero-dimensional Lie group 


acting smoothly, freely, and properly on M. 

Proof. To show that C, (M ) is a Lie group, we need only verify that it is 
countable. Let j € M be arbitrary and let p = 1(p). Because the fiber 
nm 1(p) is a discrete subset of the manifold M, it is countable. Since each 
element of €,(M/) is uniquely determined by what it does to p [Lee00, 
Proposition 11.27(a)], the map y+ y(p) is an injection of I into 7~'(p); 
thus [ is countable. 

Smoothness of the action follows from the fact that any covering trans- 
formation y can be written locally as y = ao 7 for a suitable smooth local 
section o. The action is free because the only covering transformation that 
fixes any point is the identity. 
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To show that the action is proper, we will show that it satisfies condi- 
tions (i) and (ii) of Exercise 7.4. If p € M, let U be an evenly covered 
neighborhood of x(p), and let U be the component of x~!(U) containing 
p. Because each element of the covering group permutes the components 
of x~!(U) [Lee00, Proposition 11.27(d)], it follows that U satisfies (i). (In 
fact, it satisfies the stronger condition that (g-U)NU = @ for allg eG 
except g = e.) 

Let p,p’ € M be points in separate orbits. If m(p) 4 z(p’), then 
there are disjoint open sets U containing a(p) and U’ containing a(p’), 
so 7~'(U),a~1(U’) are disjoint open sets satisfying (ii). If m(p) = m(p'), 
let U be an evenly covered Heiehbortood of x(p), and let U, U’ be the 
components of 7~!(U) containing pand p’, respectively. If y is a covering 
transformation such that g(U)NU’ 4 @, then e(O ) = U' because covering 
transformations permute the components of 7~!(U); therefore, since each 
component contains exactly one point of 7— ‘(p ), it follows that y(p) = p’, 
which contradicts the assumption that p and p’ are in different orbits. Thus 
U and U’ satisfy (ii). 


The quotient manifold theorem yields the following converse to this 
proposition. 


Theorem 7.13. Suppose M is a connected smooth manifold, and a_dis- 
crete Lie group T acts smoothly, freely, and properly on M. Then M/T 
is a_topological manifold and has a unique smooth structure such that 
tw: M — M/T is a smooth covering map. 


Proof. It follows from the quotient manifold theorem that M /T has a 
unique smooth manifold structure such that 7 is a smooth submersion. 
Because dim M M/ T =dimM-—dimI = dim M, this implies that 7 is a local 
diffeomorphism. On the other hand, it follows from the theory of covering 
spaces [Lee00, Corollary 12.12] that 7 is a topological covering map. Thus 
m is a smooth covering map. Uniqueness of the smooth structure follows 
from the uniqueness assertion of the quotient manifold theorem, because a 
smooth covering map is in particular a submersion. 


Example 7.14 (Proper Discrete Group Actions). 


(a) The discrete Lie group Z” acts smoothly and freely on R” by trans- 
lation (Example 7.3(g)). To check that the action is proper, one 
can verify that condition (7.3) is satisfied by sufficiently small balls 
around p and p’. The quotient manifold R"/Z” is homeomorphic to 
the n-torus T”, and Theorem 7.13 says that there is a unique smooth 
structure on T” making the quotient map into a smooth covering 
map. To verify that this smooth structure on T” is the same as the 
one we defined previously (thinking of T” as the product manifold 
S! x --- x S), we just check that the covering map R" — T” given 
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by (v1,...,2") 1 (e27@" |, e?#@") ig a local diffeomorphism with 
respect to the product smooth structure on T”, and apply the result 
of Proposition 5.21. 


(b) The two-element group {+1} acts on S” by multiplication. This action 
is obviously smooth and free, and it is proper because the group is 
compact. This defines a smooth structure on S”/{+1}. In fact, this 
quotient manifold is diffeomorphic to P” with the smooth structure 
we defined in Chapter 1, which can be seen as follows. Consider the 
map 7’: S” — P” defined as the composition of the inclusion 1: S” << 

R"*+1 \ {0} followed by the projection 7: R"t! \ {0}  P” defining 

P”. This is a smooth covering map (see Problem 7-1), and makes the 

same identifications as 7. By the result of Problem 5.21, S”/{+1} is 

diffeomorphic to P”. 


Quotients of Lie Groups 


Another important application of the quotient manifold theorem is to the 
study quotients of Lie groups by Lie subgroups. Let G be a Lie group and 
let H C G be a Lie subgroup. If we let H act on G by right translation, 
then an element of the orbit space is the orbit of an element g € G, which 
is a set of the form gH = {gh: h © H}. In other words, an orbit under 
the right action by H is a left coset of H. We will use the notation G/H to 
denote the orbit space by this right action (the left coset space). 


Theorem 7.15. Let G be a Lie group and let H be a closed Lie subgroup 
of G. The action of H on G by right translation is smooth, free, and proper. 
Therefore the left coset space G/H is a smooth manifold, and the quotient 
map 7: G— G/H is a smooth submersion. 


Proof. We already observed in Example 7.3(f) that H acts smoothly and 
freely on G. To see that the action is proper, let O0:Gx H—~GxG 
be the map O(g,h) = (gh,g), and suppose L Cc G x G is a compact set. 
If {(gi,hi)} is a sequence in @~1(L), then passing to a subsequence if 
necessary we may assume that the sequences {g;h;} and {g;} converge. By 
continuity, therefore, h; = g; '(gihi) converges to a point in G, and since 
FH is closed in G it follows that {(g;, h:)} converges in G x H. 


A discrete subgroup of a Lie group is a subgroup that is a discrete set in 
the subspace topology (and is thus an embedded zero-dimensional Lie sub- 
group). The following corollary is an immediate consequence of Theorems 
7.13 and 7.15. 


Corollary 7.16. Let G be a Lie group, and let T C G be a discrete sub- 
group. Then the quotient map 7: G — G/T is a smooth covering map. 
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Example 7.17. Let C be the unit cube centered at the origin in R?. The 
set [ of positive-determinant orthogonal transformations of R? that take 
C to itself is a finite subgroup of SO(3), and the quotient SO(3)/T is a 
connected smooth 3-manifold whose universal cover is S? (see Problem 7- 
13). Similar examples are obtained from the symmetry group of any regular 
polyhedron, such as a regular tetrahedron, dodecahedron, or icosahedron. 


Homogeneous Spaces 


One of the most interesting kinds of group action is that in which a group 
acts transitively. A smooth manifold endowed with a transitive smooth 
action by a Lie group G is called a homogeneous G-space, or a homogeneous 
space or homogeneous manifold if it is not important to specify the group. 

In most examples, the group action preserves some property of the man- 
ifold (such as distances in some metric, or a class of curves such as straight 
lines in the plane); then the fact that the action is transitive means that 
the manifold “looks the same” everywhere from the point of view of this 
property. Often, homogeneous spaces are models for various kinds of geo- 
metric structures, and as such they play a central role in many areas of 
differential geometry. 

Here are some important examples of homogeneous spaces. 


Example 7.18 (Homogeneous Spaces). 


(a) The natural action of O(n) on S"~! is transitive, as we observed in 
Example 7.3. So is the natural action of SO(n) on S"~! when n > 2. 
Thus for n > 2,S"~1 is a homogeneous space of either O(n) or SO(n). 


(b) Let E(n) denote the subgroup of GL(n + 1, R) consisting of matrices 


of the form 
A b 7 
re 1) Aco, ber’ h, 


where b is considered as an n x 1 column matrix. It is straightforward 
to check that E(n) is an embedded Lie subgroup. If $ c Rt! denotes 
the affine subspace defined by x”+! = 1, then a simple computation 
shows that E(n) takes S to itself. Identifying S with R” in the obvious 
way, this induces an action of E(n) on R”, in which the matrix (4 °) 
sends x to Ax + b. It is not hard to prove that these are precisely 
the transformations that preserve the Euclidean inner product (see 
Problem 7-17). For this reason, E(n) is called the Euclidean group. 
Because any point in R” can be taken to any other by a translation, 
E(n) acts transitively on R", so R” is a homogeneous E(n)-space. 
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(c) The group SL(2,R) acts smoothly and transitively on the upper half- 
plane H = {z € C: Imz > 0} by the formula 


a b az+b 
-_Zz= ——. 
c d cz+d 
The resulting complex-analytic transformations of H are called 
Mobius transformations. 


(d) If G is any Lie group and H is a closed Lie subgroup, the space G/H 
of left cosets is a smooth manifold by Theorem 7.15. We define a left 
action of G on G/H by 


gi + (92H) = (9192) H. 


This action is obviously transitive, and Proposition 5.20 implies that 
it is smooth. 


Exercise 7.5. Show that both U(n) and SU(n) act smoothly and transi- 
tively on S?”~1, thought of as the set of unit vectors in C”. 


Example 7.18(d) above turns out to be of central importance because, 
as the next theorem shows, every homogeneous space is equivalent to one 
of this type. 


Theorem 7.19 (Characterization of Homogeneous Spaces). Let 
M be a homogeneous G-space, and let p be any point of M. Then 
the isotropy group Gy is a closed Lie subgroup of G, and the map 
F: G/G, — M defined by F(gGp) = 9+ p is an equivariant diffeomor- 
phism. 


Proof. For simplicity, let us write H = Gy. First we will show that H is 
a closed Lie subgroup. Define a map ®: G > M by ®(g) = g- p. This is 
obviously smooth, and H = ®~!(p). Observe that 


®(9'9) = (9'9) -p=9'-(g-p)=9' - B49), 


so ® is equivariant with respect to the action by G on itself by left multipli- 
cation and the given G-action on M. This implies that H is an embedded 
submanifold of G and therefore a closed Lie subgroup. 

To see that F' is well defined, assume that g1H = go2H, which means that 
91, 92 € H. Writing g7'g2 = h, we see that 


F(g2.H) = 92-p=gih-p=g1:-p=F(mH). 


Also, F is equivariant, because 


F(g'gH) = (9'9) -p=9' - F(gH). 
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It is smooth because it is obtained from ® by passing to the quotient (see 
Proposition 5.20). 

Next we show that F is bijective. Given any point q € M there is a group 
element g € G such that F(gH) = g+-p = q by transitivity. On the other 
hand, if F(giH) = F(g2H), then gi - p = g2- p implies g; ‘go + p = p, so 
9,92 € H, which implies gH = goH. 

Because F is a bijective smooth map of constant rank, it is a diffeomor- 
phism by Proposition 6.5. 


This theorem shows that the study of homogeneous spaces can be reduced 
to the largely algebraic problem of understanding closed Lie subgroups of 
Lie groups. Because of this, some authors define a homogeneous space to 
be a quotient manifold of the form G/H, where G is a Lie group and H is 
a closed Lie subgroup of G. 

Applying this theorem to the examples of transitive group actions we 
developed earlier, we see that some familiar spaces can be expressed as 
quotients of Lie groups by closed Lie subgroups. 


Example 7.20 (Homogeneous Spaces Revisited). 


(a) Consider again the natural action of O(n) on S"~!. If we choose 
our base point in S”~! to be the “north pole” N = (0,...,0,1), 
it is easy to check that the isotropy group is O(n — 1), thought of 
as orthogonal transformations of R” that fix the last variable. Thus 
S”"—! is diffeomorphic to the quotient manifold O(n)/O(n — 1). For 
the action of SO(n) on S"~!, the isotropy group is SO(n—1), so S"-+ 
is also diffeomorphic to SO(n)/ SO(n — 1). 


(b) Similarly, using the result of Exercise 7.5, we conclude that S?”~! = 
U(n)/U(n — 1) & SU(n)/SU(n — 1). 


(c) Because the Euclidean group E(n) acts smoothly and transitively on 
R”, and the isotropy group of the origin is the subgroup O(n) C E(n) 
(identified with the (n+ 1) x (n+ 1) matrices of the form (#9) with 
A € O(n)), R” is diffeomorphic to E(n)/ O(n). 


Application: Sets with Transitive Group Actions 


A highly useful application of the characterization theorem is to put smooth 
structures on sets that admit transitive Lie group actions. 


Proposition 7.21. Suppose X is a set, and we are given a transitive ac- 
tion of a Lie group G on X, such that the isotropy group of a point p © X 
is a closed Lie subgroup of G. Then X has a unique manifold topology and 
smooth structure such that the given action is smooth. 
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Proof. Let H denote the isotropy group of p, so that G/H is a smooth 
manifold by Theorem 7.15. The map fF’: G/H — X defined by F(gH) = 
g-p is an equivariant bijection by exactly the same argument as we used in 
the proof of the characterization theorem. (That part did not use the fact 
that M was a manifold at all.) If we define a topology and smooth structure 
on X by declaring F' to be a diffeomorphism, then the given action of G 
on X is smooth because it can be written (g,2) > F(g- F7(a)). 

If X denotes the set X with any smooth manifold structure such that 
the given action is smooth, then by the homogeneous space characterization 
theorem, X is equivariantly diffeomorphic to G/H and therefore to X, so 
the topology and smooth structure are unique. 


Example 7.22 (Grassmannians). Let G(k,n) denote the set of k- 
dimensional subspaces of R” as in Example 1.15. The general linear group 
GL(n,R) acts transitively on G(k,n): Given two subspaces A and A’, 
choose bases for both subspaces and extend them to bases for R”, and 
then the linear transformation taking the first basis to the second also 
takes A to A’. The isotropy group of the subspace R* C R” is 


a 


>) 


H={(4 > :A€GL(k,R), DE GL(n—k,R 


BeM(kx (n—w),B)}, 


which is a closed Lie subgroup of GL(n, R). Therefore G(k,n) has a unique 
smooth manifold structure making the natural GL(n,R) action smooth. 
Problem 7-19 shows that this is the same smooth structure we defined in 
Example 1.15. 


Example 7.23 (Flag Manifolds). Let V be a real vector space of di- 


mension n > 1, and let K = (ki,...,km) be a finite sequence of integers 
satisfying 0 < ky <--- < km <n. A flag in V of type K is a sequence 
of linear subspaces S$; C Sg C--: C Sm C V, with dim S; = k; for each 


i. The set of all flags of type K in V is denoted Fx (V). (For example, 
if K = (k), then Fx(V) is the Grassmannian G;(V).) It is not hard to 
show that GL(V) acts transitively on Fx (V) with a closed Lie subgroup as 
isotropy group (see Problem 7-23), so Fx (V) has a unique smooth manifold 
structure making it into a homogeneous GL(V)-space. With this structure, 
Fx(V) is called a flag manifold. 


Application: Connectivity of Lie Groups 


Another application of homogeneous space theory is to identify the con- 
nected components of many familiar Lie groups. The key result is the fol- 
lowing proposition. 
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Proposition 7.24. Suppose a Lie group G acts smoothly, freely, and prop- 
erly on a manifold M. If G and M/G are connected, then M is connected. 


Proof. Suppose M is not connected. This means that there are nonempty, 
disjoint open sets U,V Cc M whose union is M. Because the quotient 
map 7: M — M/G is an open map (Proposition 5.18), 7(U) and 7(V) 
are nonempty open subsets of M/G. If 7(U) N z(V) # @, there is a G- 
orbit that contains points of both U and V. However, each orbit is an 
embedded submanifold diffeomorphic to G, which is connected, so each 
orbit lies entirely in one of the sets U or V. Thus {n(U),a(V)} would 
be a separation of M/G, which contradicts the assumption that M/G is 
connected. 


Proposition 7.25. For any n, the Lie groups SO(n), U(n), and SU(n) 
are connected. The group O(n) has exactly two components, one of which 


is SO(n). 


Proof. We begin by proving that SO(n) is connected by induction on n. 
For n = 1 this is obvious, because SO(1) is the trivial group. Now suppose 
we have shown that SO(n — 1) is connected for some n > 2. Because the 
homogeneous space SO(n) / SO(n—1) is diffeomorphic to S”~! and therefore 
is connected, Proposition 7.24 and the induction hypothesis imply that 
SO(n) is connected. A similar argument applies to U(n) and SU(n), using 
the facts that U(n)/U(n — 1) » SU(n)/SU(n — 1) = S?7-1. 

Note that O(n) is equal to the union of the two open sets O*(n) and 
O(n) consisting of orthogonal matrices whose determinant is +1 or —1, 
respectively. As we noted earlier, O(n) = SO(n), which is connected. On 
the other hand, if A is any orthogonal matrix whose determinant is —1, then 
left translation L 4 is a diffeomorphism from Ot (n) to O7 (n), so O7 (n) is 
connected as well. Therefore {OT (n),O7 (n)} are exactly the components 
of O(n). 


Determining the components of the general linear groups is a bit more 
involved. Let GLt(n,R) and GL7~(n,R) denote the subsets of GL(n,R) 
consisting of matrices with positive determinant and negative determinant, 
respectively. 


Proposition 7.26. The components of GL(n,R) are GL*(n,R) and 
GL (n,R). 


Proof. We begin by showing that GL*(n,R) is connected. It suffices to 
show that it is path connected, which will follow once we show that there 
is a continuous path in GL*(n,R) from any A € GL*(n,R) to the identity 
matrix In. 

Let A € GL*(n,R) be arbitrary, and let (Aj,...,4n) denote the 
columns of A, considered as vectors in R”. The Gram-Schmidt algorithm 
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(Proposition A.17 in the Appendix) shows that there is an orthonormal 
basis (Qi,..-,Qn) for R” with the property that span(Qi,...,Qz) = 
span(A,,...,A,) for each k = 1,...,n. Thus we can write 


Ay — RiQi, 


Ay = R3Q1 + R3Qo, 


An = RiQi+ R2Qo+---+ RPQn, 


for some constants R}. Replacing each Q; by —Q; if necessary, we may 
assume that Rt > 0 for each i. In matrix notation, this is equivalent to 
A= QR, where R is upper triangular with positive entries on the diagonal. 
Since the determinant of FR is the product of its diagonal entries and det A = 
(det Q)(det R) > 0, it follows that Q € SO(n). (This QR decomposition 
plays an important role in numerical linear algebra.) 

Let R; = tl, +(1—-#)R. It is immediate that R; is upper triangular with 
positive diagonal entries for all t € [0,1], so R, € GL*(n,R). Therefore, 
the path y: [0,1] ~ GLt(n,R) given by y(t) = QR; satisfies y(0) = A 
and y(1) = Q € SO(n). Because SO(n) is connected, there is a path in 
SO(n) from Q to the identity matrix. This shows that GL*(n,R) is path 
connected. 

Now, as in the case of O(n), any matrix B with det B < 0 yields a 
diffeomorphism Lg: GLt(n,R) —- GL7(n,R), so GL7(n,R) is connected 
as well. This completes the proof. 
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Problems 


7-1. 


7-2. 


7-3. 


7-6. 


7-7. 


7-8. 


7-9. 


7-10. 


Let 7: S” — P” be the map that sends x € S” to the line through the 
origin and x, thought of as a point in P”. Show that a is a smooth 
covering map. 


Define a map F:: P? > R* by 
F(a, y, z] = (2? — y*, zy, 22, yz). 
Show that F is a smooth embedding. 
Let G be a Lie group and H C G aclosed normal Lie subgroup. Show 


that G/H is a Lie group and the quotient map 7: G — G/H is a Lie 
homomorphism. 


. If F: G— GF is a surjective Lie group homomorphism, show that H 


is Lie isomorphic to G/ Ker F’. 


. Let G be a connected Lie group, and suppose fF’: G — H is a surjec- 


tive Lie group homomorphism with discrete kernel. Show that F is a 
smooth covering map. 


Let G be a Lie group, and suppose 7: G= Cis any covering map. 
For any point € € ~!(e), show that G has a unique Lie group struc- 
ture such that € is the identity element of G and is a Lie group 
homomorphism. 


(a) Show that there exists a Lie group homomorphism p : S' > U(n) 
such that det op = Idg.. 


(b) Show that U(n) is diffeomorphic to S' x SU(n). [Hint: Consider 
the map y: S! x SU(n) — U(n) given by y(z, A) = p(z) A] 


(c) Show that U(n) and St x SU(n) are not isomorphic Lie groups. 
Show that SU(2) is diffeomorphic to S°. 


Let G be a Lie group, and let Gop denote the connected component 
of the identity (called the identity component of G). 


(a) Show that Go is an embedded Lie subgroup of G, and that each 
connected component of G is diffeomorphic to Go. 


(b) If H is any connected open subgroup of G, show that H = Go. 
Suppose a Lie group acts smoothly on a manifold M. 


(a) Show that each orbit is an immersed submanifold of M. 
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(b) Give an example of a Lie group acting smoothly on a mani- 


fold M in which two different orbits have different dimensions 
even though neither orbit has dimension equal to zero or to the 
dimension of M. 


7-11. Prove the following partial converse to the quotient manifold theorem: 
If a Lie group G acts smoothly and freely on a smooth manifold 
and the orbit space //G has a smooth manifold structure such that 
the quotient map 7: M — M/G' is asmooth submersion, then G acts 
properly. 


7-12. Give an example of a smooth, proper action of a Lie group on a 
smooth manifold such that the orbit space is not a topological man- 


ifold. 


7-13. Prove that SO(3) is Lie isomorphic to SU(2)/{+J} and diffeomorphic 
to P’, as follows. 


(a) Let H denote the set of 2 x 2 Hermitian matrices whose trace is 


ws 


zero. (The trace of a matrix is the sum of its diagonal entries.) 
Show that H is a 3-dimensional vector space over R, and 


1 0 01 0 i 
m=( A) BG) (Ga) 


is a basis for H. 


If we give H the inner product for which (Fj), Eo, F3) is an or- 
thonormal basis, show that |A|? = — det A for all A € K. 


Identifying GL(3, R) with the set of invertible real-linear maps 
H — H by means of the basis (£), Eo, E3), define a map 
p: SU(2) > GL(3, R) by 


p(X)A= XAX"}, X €SU(2), AEX. 


Show that p is a Lie group homomorphism whose image is SO(3) 
and whose kernel is {+}. [Hint: To show that the image is all 
of SO(3), show that p is open and closed and use the results of 
Problem 7-9.] 


(d) Prove the result. 


7-14. Determine which of the following Lie groups are compact: GL(n,R), 
SL(n, R), GL(n, C), SL(n, C), U(n), SU(n). 


7-15. Show that GL(n, C) is connected. 


7-16. Show that SL(n,R) and SL(n,C) are connected. 


7-17. 


7-18. 
7-19. 


7-20. 
7-21. 


7-22. 


7-23. 


7-24. 


7-25. 
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Prove that the set of maps from R” to itself given by the action of 
E(n) on R” described in Example 7.18(b) is exactly the set. of all 
maps from R” to itself that preserve the Euclidean inner product. 


Prove that the Grassmannian G(k,) is compact for any k and n. 


Show that the smooth structure on the Grassmannian G(k, n) defined 
in Example 7.22 is the same as the one defined in Example 1.15. 


Show that the image of a Lie group homomorphism is a Lie subgroup. 


(a) Let Gand H be Lie groups. Suppose p: H x G > G is a smooth 
left action of H on G with the property that p,: G — G isa Lie 
group homomorphism for every h € H. Define a group structure 
on the manifold G x H by 


(g,h)(g',h’) = (gen(g’), hh’). 
Show that this turns Gx H into a Lie group, called the semidirect 
product of G and H induced by p, and denoted by G x, H. 
(b) If G is any Lie group, show that G is Lie isomorphic to a semi- 


direct product of a connected Lie group with a discrete group. 


Define an action of Z on R? by 


n+ (x,y) = (@+n,(-1)"y). 


(a) Show that the action is smooth, free and proper. Let E = R?/Z 
denote the quotient manifold. 


(b) Show that the projection on the first coordinate 7,: R? — R 
descends to a smooth map 7: E — S!. 


(c) Show that EF is a rank-1 vector bundle over St with projection 
m. (It is called the Mobius bundle.) 


(d) Show that F is not a trivial bundle. 


Let Fx (V) be the set of flags of type K in a finite-dimensional vector 
space V as in Example 7.23. Show that GL(V) acts transitively on 
Fx(V), and that the isotropy group of a particular flag is a closed 
Lie subgroup of GL(V). 


The n-dimensional complex projective space, denoted by CP”, is the 
set of 1-dimensional complex subspaces of C"+!. Show that CP” 
has a unique topology and smooth structure making it into a 2n- 
dimensional compact manifold and a homogeneous space of U(n). 


Show that CP! is diffeomorphic to S?. 
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7-26. 


7-27. 
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Considering S?”+! as the unit sphere in C”+!, define an action of S! 
on S2"+1 by 


pat casa = ur ee ls 

Show that this action is smooth, free, and proper, and that the orbit 
space S2”*!/§1 is diffeomorphic to CP”. [Hint: Consider the restric- 
tion of the natural quotient map C”*! \ {0} > CP” to S?"t!. The 


quotient map 7: S?”+! — CP” is known as the Hopf map.] 


Let ¢ be an irrational number, and let R act on T? = S! x S! by 
t- (w, 2) = (er er 2), 


Show that this is a smooth free action, but the quotient T?/R is not 
Hausdorff. 
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‘Tensors 


Much of the machinery of smooth manifold theory is designed to allow the 
concepts of linear algebra to be applied to smooth manifolds. For example, 
a tangent vector can be thought of as a linear approximation to a curve; the 
tangent space to a submanifold can be thought of as a linear approximation 
to the submanifold; and the push-forward of a smooth map can be thought 
of as a linear approximation to the map itself. Calculus tells us how to 
approximate smooth objects by linear ones, and the abstract definitions of 
manifold theory give a way to interpret these linear approximations in a 
coordinate-invariant way. In this chapter, we carry this idea much further, 
by generalizing from linear objects to multlinear ones. This leads to the 
concepts of tensors and tensor fields on manifolds. 

We begin with tensors on a vector space, which are multilinear general- 
izations of covectors; a covector is the special case of a tensor of rank one. 
We give two alternative definitions of tensors on a vector space: On the 
one hand, they are real-valued multilinear functions of several vectors; on 
the other hand, they are elements of the abstract “tensor product” of the 
dual vector space with itself. Each definition is useful in certain contexts. 
We then discuss the difference between covariant and contravariant tensors, 
and give a brief introduction to tensors of mixed variance. 

We then move to smooth manifolds, and define tensors, tensor fields, 
and tensor bundles. After describing the coordinate representations of ten- 
sor fields, we describe how they can be pulled back by smooth maps. We 
introduce a special class of tensors, the symmetric ones, whose values are 
unchanged by permutations of their arguments. 
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The last section of the chapter is an introduction to one of the most 
important kinds of tensor fields, Riemannian metrics. A thorough treatment 
of Riemannian geometry is beyond the scope of this book, but we can at 
least lay the groundwork by giving the basic definitions and proving that 
every manifold admits Riemannian metrics. 


The Algebra of Tensors 


Suppose V,..., Vz and W are vector spaces. A map F’: Vj x---x Vz — W is 
said to be multilinear if it is linear as a function of each variable separately: 


F(v1,...,av; +.a’v},..., vx) 
=aF (v4,...,Vi,;-.-,Uk) +a’ F(v1,...,U;,---, Up): 


(A multilinear function of two variables is generally called bilinear.) Al- 

though linear maps are paramount in differential geometry, there are many 

situations in which multilinear maps play an important geometric role. 
Here are a few examples to keep in mind: 


e The dot product in R” is a scalar-valued bilinear function of two vec- 
tors, used to compute lengths of vectors and angles between them. 


e The cross product in R® is a vector-valued bilinear function of two 
vectors, used to compute areas of parallelograms and to find a third 
vector orthogonal to two given ones. 


e The determinant is a real-valued multilinear function of n vectors in 
R”, used to detect linear independence and to compute the volume 
of the parallelepiped spanned by the vectors. 


In this section, we will develop a unified language for talking about mul- 
tilinear functions—the language of tensors. In a little while, we will give a 
very general and abstract definition of tensors. But it will help to clarify 
matters if we start with a more concrete definition. 

Let V be a finite-dimensional real vector space, and k a natural number. 
(Many of the concepts we will introduce in this section—at least the parts 
that do not refer explicitly to finite bases—work equally well in the infinite- 
dimensional case; but we will restrict our attention to the finite-dimensional 
case in order to keep things simple.) 

A covariant k-tensor on V is a real-valued multilinear function of k ele- 
ments of V: 


T: Vx-->xVorR. 
~ -— 


k, copies 
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The number & is called the rank of T’. A 0-tensor is, by convention, just a 
real number (a real-valued function depending multilinearly on no vectors!). 
The set of all covariant k-tensors on V, denoted by T*(V), is a vector space 
under the usual operations of pointwise addition and scalar multiplication: 


(aT)(Xq,..., Xx) =a(T(X1,...,Xz)), 
Gea ini: HUE: 2 Eo Connie Come AL NS, C2 
Let us look at some examples. 
Example 8.1 (Covariant Tensors). 


(a) Every linear map w: V — R is multilinear, so a covariant 1-tensor is 
just a covector. Thus T1(V) is naturally identified with V*. 


(b) A covariant 2-tensor on V is a real-valued bilinear function of two 
vectors, also called a bilinear form. One example is the dot product 
on R”. More generally, any inner product on V is a covariant 2-tensor. 


(c) The determinant, thought of as a function of n vectors, is a covariant 
n-tensor on R”. 


(d) Suppose w,7 € V*. Define a map w @n: V x V > R by 
w (X,Y) = w(X)n(Y), 


where the product on the right is just ordinary multiplication of real 
numbers. The linearity of w and 7 guarantees that w © 7 is a bilinear 
function of X and Y, i.e., a 2-tensor. 


The last example can be generalized to tensors of any rank as follows. Let 
V be a finite-dimensional real vector space and let $ € T*(V), T € T'(V). 
Define a map 


S@T:Vx---xV—-R 
—~ _ 


k+l copies 


by 
S® T(X1, sss »Xk41) = S(X1, spe > Xk)T(Xkq1, sth jana) 


It is immediate from the multilinearity of S and T that S @ T depends 
linearly on each argument X; separately, so it is a covariant (k + 1)-tensor, 
called the tensor product of S' and T. 


Exercise 8.1. Show that the tensor product operation is bilinear and as- 
sociative. More precisely, show that S @ T depends linearly on each of the 
tensors S and T, and that (R®S)®T=R®@(S@T). 


174 8. Tensors 


Because of the result of the preceding exercise, we can write the tensor 
product of three or more tensors unambiguously without parentheses. If 
T,,..., 7, are tensors of ranks kj,...,k; respectively, their tensor product 
T, ®---@® Tj is a tensor of rank k = ky +---+k,, whose action on k vectors 
is given by inserting the first k, vectors into T;, the next kg vectors into 
Ty, and so forth, and multiplying the results together. For example, if R 
and S' are 2-tensors and T is a 3-tensor, then 


R®S@T(X,...,X7) = R(X1, X2)S(X3, X4)T(X5, X6, X7). 


Proposition 8.2. Let V be a real vector space of dimension n, let (E;) be 
any basis for V, and let (e*) be the dual basis. The set of all k-tensors of 
the form e2 @---@e* for 1<i,,...,in <n is a basis for T*(V), which 
therefore has dimension n*. 

Proof. Let B denote the set {67 @+--@e :1<i4,...,ip <n}. We need 
to show that B is independent and spans T*(V). Suppose T € T*(V) is 
arbitrary. For any k-tuple (i1,...,%,) of integers such that 1 < i; < n, 
define a number Tj, ..;, by 


We will show that 
= Poe @++-@e* 


(with the summation convention in effect as usual), from which it follows 
that B spans V. We compute 


Tae Qe @ er (Ey, oes pity) = Ty age? (E; ) a gtk (E; ) 
= Ti, 55, + OF 
= Tye 


27 (FBy 25), Ej,): 


By multilinearity, a tensor is determined by its action on sequences of basis 
vectors, so this proves the claim. 

To show that B is independent, suppose some linear combination equals 
Zero: 


Tae" @-:++®@ gtk = 0. 


Apply this to any sequence (E;,,...,;,) of basis vectors. By the same 
computation as above, this implies that each coefficient T;,_..;, is zero. 
Thus the only linear combination of elements of B that sums to zero is the 
trivial one. 
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The proof of this proposition shows, by the way, that the components 
T;,...;, of a tensor T in terms of the basis tensors in B are given by (8.1). 

It is useful to see explicitly what this proposition means for tensors of 
low rank. 


e k=0: T°(V) is just R, so dimT°(V) =1=7n°. 
e k=1:T1(V) = V* has dimension n = n!. 


e k = 2: T?(V) is the space of bilinear forms on V. Any bilinear form 
can be written uniquely as T = T;;¢’ @€7, where (Tj;) is an arbitrary 
n x n matrix. Thus dim T?(V) = n?. 


Abstract Tensor Products of Vector Spaces 


Because every covariant k-tensor can be written as a linear combination of 
tensor products of covectors, it is suggestive to write 


T*(V) =V*®---@V*, 


where we think of the expression on the right-hand side as a shorthand for 
the set of all linear combinations of tensor products of elements of V*. 

We will now give a construction that makes sense of this notation in a 
much more general setting. The construction is a bit involved, but the idea 
is simple: Given vector spaces V and W, we will construct a vector space 
V @W that consists of linear combinations of objects of the form v © w for 
v € V, w € W, defined in such a way that v © w depends bilinearly on v 
and w. 

Let S be a set. The free vector space on S, denoted R(S), is the set of 
all finite formal linear combinations of elements of S$ with real coefficients. 
More precisely, a finite formal linear combination is a function F: S — R 
such that ¥(s) = 0 for all but finitely many s € S. Under pointwise addition 
and scalar multiplication, R(S') becomes a real vector space. Identifying 
each element x € S with the function that takes the value 1 on x and zero 
on all other elements of S, any element F € R(S) can be written uniquely 
in the form F = ae a;x;, where £1,...,%m are the elements of S for 
which F(x;) # 0, and a; = F(x;). Thus S' is a basis for R(S'), which is 
therefore finite-dimensional if and only if S is a finite set. 


Exercise 8.2 (Characteristic Property of Free Vector Spaces). 
Let S be a set and W a vector space. Show that any map F': S — W has a 
unique extension to a linear map F: R(S) — W. 


Now let V and W be finite-dimensional real vector spaces, and let R be 
the subspace of the free vector space R(V x W) spanned by all elements of 
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the following forms: 


cat (8.2) 
( 


(v, w) A 


fora € R, v,v’ € V, and w, w’ € W. Define the tensor product of V and W, 
denoted by V @W, to be the quotient space R(V x W)/R. The equivalence 
class of an element (v,w) in V ® W is denoted by v ® w, and is called the 
tensor product of v and w. From the definition, tensor products satisfy 


a(v @w) =av@w=v au, 
v@uwtv @w=(vt+v)@u, 
v@wtvew =v@(wt+w’). 
Note that the definition implies that every element of V@W can be written 


as a linear combination of elements of the form v @ w for v € V, w € W; 
but it is not true in general that every element of V @ W is of this form. 


Proposition 8.3 (Characteristic Property of Tensor Products). 
Let V and W be finite-dimensional real vector spaces. If A: V x W > Y 
is a bilinear map into any vector space Y, there is a unique linear map 
A: V@W —Y such that the following diagram commutes: 


vxw—‘4ty 
7 A 
VOW, (8.3) 


where m(v,w) =v@w. 


Proof. First note that any map A: V x W — X extends uniquely to a 
linear map A: R(V x W) — X by the characteristic property of the free 
vector space. This map is characterized by the fact that A(v,w) = A(v, w) 
whenever (v,w) € V x W C R(V x W). The fact that A is bilinear means 
precisely that the subspace & is contained in the kernel of A, because 


A(av,w) = A(av, w) 
= aA(v, w) 
= aA(v, w) 
= A(a(v, w)), 


with similar considerations for the other expressions in (8.2). Therefore, 
A descends to a linear map A: V @W = R(V x W)/R = X satisfying 
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Aon = A. Uniqueness follows from the fact that every element of V © W 
can be written as a linear combination of elements of the form v © w, 


and A is uniquely determined on such elements by A(v ® w) = A(v,w) = 
A(v,w). 


The reason this is called the characteristic property is that it uniquely 
characterizes the tensor product up to isomorphism; see Problem 8-1. 


Proposition 8.4 (Other Properties of Tensor Products). Let V, 
W, and X be finite-dimensional real vector spaces. 


(a) The tensor product V* @ W* is canonically isomorphic to the space 
B(V,W) of bilinear maps from V x W into R. 


(b) If (Ej) is a basis for V and (F;) is a basis for W, then the set of all 
elements of the form EH; ® Fj is a basis for V @ W, which therefore 
has dimension equal to (dim V)(dim W). 


(c) There is a unique isomorphism V @(W@X) > (V@W)@X sending 
v@(w@z) to (v@w) @z. 


Proof. The canonical isomorphism between V* ® W* and B(V,W) is con- 
structed as follows. First, define a map ®: V* x W* — B(V,W) by 


O(w, 1) (v, w) = w(v)n(w). 


It is easy to check that ® is bilinear, so by the characteristic property it 
descends uniquely to a linear map ®: V* @ W* > B(V,W). 

To see that ® is an isomorphism, we will construct an inverse for it. Let 
(£;) and (F;) be any bases for V and W, respectively, with dual bases 
(ct) and (y’). Since V* @ W* is spanned by elements of the form w @ 7 
for w € V* and 7 € W%*, every tT € V* © W* can be written in the form 
T = Tij€' @ vy’. (We are not claiming yet that this expression is unique.) 

Define a map UV: B(V,W) — V* @ W* by setting 


U(b) = (Ex, Fije® @ ol. 
We will show that W and © are inverses. First, for T = 7ije"@y? € V*@W*, 
Wo &(r) = (7)(Ex, Fe* @¢! 
O(c’ @ )(Ex, Fe* @ g! 
= Tip P(E", y')(Ex, Fije* ® y! 
= Ti€"(Ex)y' (Fije® @ ¢! 


= Tigé & y? 


=T. 
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On the other hand, for b€ B(V,W), v€ V, and w € W, 
B 0 U(b)(v, w) = O(b(Ex, Fie* & ¢')(v, w) 
= (Ex, Fi) ®(e* @ g')(v, w) 


= b(Ex, Fije*(v)y'(w) 
(Ex, Fi)v k w 
( 


e 
= 


Thus ¥ = 6-1. (Note that although we used bases to prove that © is 
invertible, ® itself is canonically defined without reference to any basis.) 

We have already observed above that the elements of the form e* ® y/ 
span V* @ W*. On the other hand, it is easy to check that dim B(V,W) = 
(dim V)(dim W) (because any bilinear form is uniquely determined by its 
action on pairs of basis elements), so for dimensional reasons the set {e’ ® 
yp} is a basis for V* @ W*. 

Finally, the isomorphism between V @ (W @ X) and (V @ W) ® X is 
constructed as follows. For each x € X, the mapa,: VxW - V@(We@X) 
defined by 


az(v,w) =Vv®(w @z2) 


is obviously bilinear, and thus by the characteristic property of the tensor 
product it descends uniquely to a linear map @,: V @W —~-V@(W@X) 
satisfying @,(v ® w) = v®(w®@ 2). Similarly, the map 3: (V@W) x X > 
V @(W ® X) given by 


G(r, x) = A(T) 
determines a linear map 3: (V @W) @X > V @ (W @X) satisfying 
B((v @ w) @ x) =v (wa). 


Because V ® (W @ X) is spanned by elements of the form v @ (w @ 2), 
G is clearly surjective, and therefore it is an isomorphism for dimensional 
reasons. It is clearly the unique such isomorphism, because any other would 
have to agree with @ on the set of elements of the form (v ® w) ® x, which 
spans (V @W) @X. 


The next corollary explains the relationship between this abstract tensor 
product of vector spaces and the more concrete covariant k-tensors we 
defined earlier. 


Corollary 8.5. If V is a finite-dimensional real vector space, the space 
T*(V) of covariant k-tensors on V is canonically isomorphic to the k-fold 
tensor product V* ®---@V*. 
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Exercise 8.3. Prove Corollary 8.5. 


Using these results, we can generalize the notion of covariant tensors on 
a vector space as follows. For any finite-dimensional real vector space V, 
define the space of contravariant tensors of rank k to be 


Ty(V) =V@---@V. 


k copies 


Because of the canonical identification V = V** and Corollary 8.5, an 
element of T;,(V) can be canonically identified with a multilinear function 
from V* x --- x V* into R. In particular, T;(V) = V** & V, the space of 
“contravariant vectors.” 

More generally, for any k,l € N, the space of mixed tensors on V of type 
(*) is defined as 


T(V) =V*@---@V*@V®::-@V. 
a ee, ee” 


k copies l copies 


From the discussion above, T*(V) can be identified with the set of real- 
valued multilinear functions of k vectors and | covectors. 

In this book, we will be concerned primarily with covariant tensors, which 
we will think of primarily as multilinear functions of vectors, in keeping with 
our original definition. Thus tensors will always be understood to be covari- 
ant unless we explicitly specify otherwise. However, it is important to be 
aware that contravariant and mixed tensors play an important role in more 
advanced parts of differential geometry, especially Riemannian geometry. 


Tensors and Tensor Fields on Manifolds 


Now let M be a smooth manifold. We define the bundle of covariant k- 
tensors on M by 


T*M = || T*(TM). 
pEeM 


Similarly, we define the bundle of contravariant l-tensors by 


TM = [| 1@,™), 
pEeM 


and the bundle of mixed tensors of type (*) by 


TPM = JI TK(TpM). 
pEeM 
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Clearly there are natural identifications 


TM =TM=M xR, 


T'M =T"*M, 
TM =TM, 
TyM=T*M, 
T2M =TM. 


Exercise 8.4. Show that T*M, T;M, and TM have natural structures 
as smooth vector bundles over M, and determine their ranks. 


Any one of these bundles is called a tensor bundle over M. (Thus the tan- 
gent and cotangent bundles are special cases of tensor bundles.) A section 
of a tensor bundle is called a (covariant, contravariant, or mixed) tensor 
field on M. A smooth tensor field is a section that is smooth in the usual 
sense of smooth sections of vector bundles. We denote the vector spaces of 
smooth sections of these bundles by 


J*(M) = {smooth sections of T*M}; 
J,;(M) = {smooth sections of T;M}; 
Th(M 


a 
In any local coordinates (a'), sections of these bundles can be written (using 
the summation convention) as 


= {smooth sections of TM}. 


Oiy...i, AZ @-+- @ dz"; 7 T*(M), 
we OD O 
—_ aaron B Oa Ti 0 €T,(M), 


Fee OSC OS pe TM) 


Ji-Il 
Seabig 


The functions 94, ..i,, ght, or 0%, 
of o in these coordinates. 


are called the component functions 


Lemma 8.6. Let M be a smooth manifold, and let o: M > T*M be a 
map (not assumed to be continuous) such that o, € T*(T,M) for each 
pe M. The following are equivalent. 


(a) o is smooth. 
(b 
(c 


) In any coordinate chart, the component functions of o are smooth. 
) If X1,...,Xz are vector fields defined on any open subset U C M, 
then the function o(X1,...,Xz): U > R, defined by 


o(X1,...,Xk)(p) = op(X1|p,---, Xelp), 


is smooth. 
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Exercise 8.5. Prove Lemma 8.6. 


Exercise 8.6. Formulate and prove smoothness criteria analogous to 
those of Lemma 8.6 for contravariant and mixed tensor fields. 


Smooth covariant 1-tensor fields are just covector fields. Recalling that a 
0-tensor is just a real number, a 0-tensor field is the same as a real-valued 
function. 


Lemma 8.7. Let M be a smooth manifold, and suppose a € T*(M), 7 € 
T'(M), and f € C~(M). Then fo anda ®T are also smooth tensor fields, 
whose components in any local coordinate chart are 


(FO) feo = fOix..ins 


(7 ® (seer = Cig. ig Tinggi ing: 


Exercise 8.7. Prove Lemma 8.7. 


Pullbacks 


Just like smooth covector fields, smooth covariant tensor fields can be pulled 
back by smooth maps to yield smooth tensor fields. 

If F: M — N is a smooth map and a is a smooth covariant k-tensor 
field on N, we define a k-tensor field F*o on M, called the pullback of o, 
by 


(F*o)p(X1, a ., Xx) = Or(p) (Fe X1, eee , FX). 


Proposition 8.8. Suppose F: M — N and G: N — P are smooth maps, 
cE T*(N), TET (N), and fe C™*(N). 


F* is linear over R. 


(a) 
(b) 
(c) F*(o @7T) = F*o ® F*r. 
(d) (Go F)* = F* oG*. 

(e) 


Exercise 8.8. Prove Proposition 8.8. 


If f is asmooth function (i.e., a 0-tensor field) and o is a smooth k-tensor 
field, then it is consistent with our definitions to interpret f®@o as fo, and 
F*f as fo F. With these interpretations, property (b) of this proposition 
is really just a special case of (c). 

Observe that properties (d) and (e) imply that the assignments M + 
T*M and F + F* yield a contravariant functor from the category of 
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smooth manifolds to itself. Because of this, the convention of calling ele- 
ments of T*M covariant tensors is particularly unfortunate; but this ter- 
minology is so deeply entrenched that one has no choice but to go along 
with it. 

The following corollary is an immediate consequence of Proposition 8.8. 


Corollary 8.9. Let F: M — N be smooth, and let co € T*(N). Ifp¢ M 
and (y/) are coordinates for N on a neighborhood of F(p), then F*o has 
the following expression near p: 


F*(05,...;,dy"! @ --- @ dy*) = (0},...5, 0 F)d(y”! 0 F) @---@ d(y* 0 F). 
Therefore F*a is smooth. 


In words, this corollary just says that F'*o is computed by the same tech- 
nique we described in Chapter 4 for computing the pullback of a covector 
field: Wherever you see y/ in the expression for o, just substitute the jth 
component function of F and expand. We will see examples of this in the 
next section. 


Symmetric Tensors 


Symmetric tensors—those whose values are unchanged by rearranging their 
arguments—play an extremely important role in differential geometry. We 
will describe only covariant symmetric tensors, but similar considerations 
apply to contravariant ones. 

It is useful to start, as usual, in the linear algebraic setting. Let V be a 
finite-dimensional vector space. A covariant k-tensor T on V is said to be 
symmetric if its value is unchanged by interchanging any pair of arguments: 


P( XG ove Miporg Xyge gy Xe) SL (Ayes ty Xap Meet Xe) 
whenever 1 < 7,7 <k. 


Exercise 8.9. Show that the following are equivalent for a covariant k- 
tensor T: 


(a) T is symmetric. 
(b) For any vectors X1,...,Xx € V, the value of T(Xi,...,X%) is un- 


changed when Xj,..., X, are rearranged in any order. 


(c) The components Ti,...4, of T with respect to any basis are unchanged 
by any permutation of the indices. 


We denote the set of symmetric covariant k-tensors on V by D*(V). 
It is obviously a vector subspace of T*(V). There is a natural projection 
Sym: T*(V) — X*(V) called symmetrization, defined as follows. First, 
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let S; denote the symmetric group on k elements, that is, the group of 
permutations of {1,...,&}. Given a k-tensor T and a permutation o € Sx, 
we define a new k-tensor T” by 


Te (Kage Sse Xo), 
Then we define Sym T by 
Sym T = : Se T°. 
o€S, 

Lemma 8.10 (Properties of Symmetrization). 

(a) For any covariant tensor T, SymT is symmetric. 

(b) T is symmetric if and only if SymT = T. 
Proof. Suppose T € T*(V). If 7 € S; is any permutation, then 


1 
(Sym TP) (Xs Meg) =a, FOG Ee) 


! 
ki oES, 
1 
= kl S- PAG Reiki) 
. oeS, 
1 
=< Oe PUR cdg KRY 
* n€Sk 


I 


(Sym T)(X1,..., Xx), 


where we have substituted 7 = o7 in the second-to-last line and used 
the fact that 7 runs over all of S; as a does. This shows that Sym T’ is 
symmetric. 


If T is symmetric, then Exercise 8.9 shows that T° = T for every a € Sx, 
so it follows immediately that Sym T = T. On the other hand, if Sym T = 
T, then T is symmetric because part (a) shows that Sym T is. 


If S and T are symmetric tensors on V, then S$ ® T is not symmetric 
in general. However, using the symmetrization operator, it is possible to 
define a new product that takes symmetric tensors to symmetric tensors. 
If S € U*(V) and T € S!(V), we define their symmetric product to be the 
(k + l)-tensor ST (denoted by juxtaposition with no intervening product 
symbol) given by 


ST = Sym(S ® T). 
More explicitly, the action of ST’ on vectors X1,..., X47 is given by 


ST(X1, figs ,XE41) 


1 
= eT Se, BOG ap Xo) E Motetiivesis Madea): 


oESk41 
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Proposition 8.11 (Properties of the Symmetric Product). 


(a) The symmetric product is symmetric and bilinear: For all symmetric 
tensors R,S,T and all a,b € R, 


ST =TS, 
(aR + bS)T = aRT +bST =T(aR + dS) 


(b) Ifw and 7 are covectors, then 
wn = 5 O@n+n@w). 
Exercise 8.10. Prove Proposition 8.11. 


A symmetric tensor field on a manifold is simply a covariant tensor field 
whose value at any point is a symmetric tensor. The symmetric product of 
two or more tensor fields is defined pointwise, just like the tensor product. 


Riemannian Metrics 


The most important examples of symmetric tensors on a vector space are 
inner products (see the Appendix). Any inner product allows us to define 
lengths of vectors and angles between them, and thus to do Euclidean 
geometry. 

Transferring these ideas to manifolds, we obtain one of the most impor- 
tant applications of tensors to differential geometry. Let M be a smooth 
manifold. A Riemannian metric on M is a smooth symmetric 2-tensor 
field that is positive definite at each point. A Riemannian manifold is a 
pair (M,g), where M is a smooth manifold and g is a Riemannian metric 
on M. One sometimes simply says “M is a Riemannian manifold” if M is 
understood to be endowed with a specific Riemannian metric. 

Note that a Riemannian metric is not the same thing as a metric in the 
sense of metric spaces, although the two concepts are closely related, as 
we will see below. Because of this ambiguity, we will usually use the term 
“distance function” when considering a metric in the metric space sense, 
and reserve “metric” for a Riemannian metric. In any event, which type of 
metric is being considered should always be clear from the context. 

If g is a Riemannian metric on M, then for each p € M, gp is an inner 
product on T,M. Because of this, we will often use the notation (X,Y), 
to denote the real number g,(X,Y) for X,Y € T,M. 


In any local coordinates (x’), a Riemannian metric can be written 


g = gijdx' @ dx’, 
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where g;; is a symmetric positive definite matrix of smooth functions. Ob- 
serve that the symmetry of g allows us to write g also in terms of symmetric 
products as follows: 


g= gigdx" ® dxi 
= $(gijdx' @ dx! + gjdx' ® dx!) (since gi; = gji) 


= $(gijdx' ® dx! + gijdx’ ® dx") (switch i j in the second term) 


I 


gijdx' dx! (definition of symmetric product). 


Example 8.12. The simplest example of a Riemannian metric is the 
Euclidean metric g on R”, defined in standard coordinates by 


g= 6;;dx' dx). 


It is common to use the abbreviation w? for the symmetric product of a 
tensor w with itself, so the Euclidean metric can also be written 


g = (det)? +--+ + (da)? 
Applied to vectors v,w € T,R”, this yields 


n 
G,(v, w) = dijv'w! = ) vw =v-w. 


i=l 


In other words, g is the 2-tensor field whose value at each point is the 
Euclidean dot product. (As you may recall, we warned in Chapter 1 that 
expressions involving the Euclidean dot product are likely to violate our 
index conventions and therefore to require explicit summation signs. This 
can usually be avoided by writing the metric coefficients 6;; explicitly, as 
in uy v'wi .) 


To transform a Riemannian metric under a change of coordinates, we use 
the same technique as we used for covector fields: Think of the change of 
coordinates as the identity map expressed in terms of different coordinates 
for the domain and range, and use the formula of Corollary 8.9. As before, 
in practice this just amounts to substituting the formulas for one set of 
coordinates in terms of the other. 


Example 8.13. To illustrate, let us compute the coordinate expression 
for the Euclidean metric on R? in polar coordinates. The Euclidean metric 
is g = dx? + dy’. (By convention, the notation dx? means the symmetric 
product dxdz, not d(x?)). Substituting « = rcos@ and y = rsin@ and 
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expanding, we obtain 


= d(rcos 6)? + d(rsin 0)? 
= (cos @ dr — rsin 6 d6)? + (sin 6 dr + r cos 6 dé)? 


(8.4) 
= (cos? 6 + sin? 6)dr? + (r? sin? 6 + r? cos? 0)d6? 


+ (—2rcos@ sin + 2rsin 6 cos 0)dr d0 
= dr? + r? dé. 
Below are just a few of the geometric constructions that can be defined 
on a Riemannian manifold (M, g). 


e The length or norm of a tangent vector X € T,M is defined to be 
|X]q = (X, XG? = gp(X, X)7?. 


e The angle between two nonzero tangent vectors X,Y € T,M is the 
unique 6 € [0,7] satisfying 


(X, Y)9 


cos§ = ——_=.. 
IX|9 IY |g 


e Two tangent vectors X,Y € T,M are said to be orthogonal if 
(X,Y) =0. 


e If y: [a,b] — M is a piecewise smooth curve segment, the length of + 
is 


b 
Lo) = f by’ Oleat 


Because |7(t)|g is continuous at all but finitely many values of t, the 
integral is well-defined. 


Exercise 8.11. If: [a,b] ~ M is a piecewise smooth curve segment and 
a<c<, show that 


Lg(y) = Lg (Yfo,q) + Lo (YI fe,0)) - 


It is an extremely important fact that length is independent of parametri- 
zation in the following sense. In chapter 4, we defined a reparametrization 
of a smooth curve segment y: [a,b] — M to be acurve segment of the form 
Y= 704, where y: [c,d] — [a,b] is a diffeomorphism. More generally, if 
7 is piecewise smooth, we allow y to be a homeomorphism whose restric- 
tion to each subinterval [c;_1, c;] is a diffeomorphism onto its image, where 
C=C <1 <-+: < cy =d is some finite subdivision of [c, d]. 
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Proposition 8.14 (Parameter Independence of Length). Let 
(M,g) be a Riemannian manifold, and let y: [a,b] > M be a piece- 
wise smooth curve segment. If Y is any reparametrization of y, then 
Lg(7) = Lg). 


Proof. First suppose that 7 is smooth, and ¢: [c, d] > [a, ] is a diffeomor- 
phism such that y = yo y. The fact that ¢ is a diffeomorphism implies 
that either y’ > 0 or y’ < 0 everywhere. Let us assume first that y’ > 0. 
We have 


'(s)Iq ds 
= L(y); 


where the second-to-last equality follows from the change of variables for- 
mula for ordinary integrals. 

In case y’ < 0, we just need to introduce two sign changes into the 
above calculation. The sign changes once when y’(t) is moved outside the 
absolute value signs, because |y’(t)| = —y’(t). Then it changes again in the 
last step, because y reverses the direction of the integral. Since the two 
sign changes cancel each other, the result is the same. 

If y and y are only piecewise smooth, we can subdivide [c, d] into finitely 
many subintervals on which both ¥ and y are smooth, and then the result 
follows by applying the above argument on each such subinterval. 


Suppose (M,g) and (M,g) are Riemannian manifolds. A smooth map 
F: M — M is called an isometry if it is a diffeomorphism that satisfies 
F*g = g. If there exists an isometry between M and M, we say that M 
and M are isometric as Riemannian manifolds. More generally, F’ is called 
a local isometry if every point p € M has a neighborhood U such that F\y 
is an isometry of U onto an open subset of M. A metric g on M is said to 
be flat if every point p € M has a neighborhood U C M such that (U, g|v) 
is isometric to an open subset of R” with the Euclidean metric. 

Riemannian geometry is the study of properties of Riemannian mani- 
folds that are invariant under isometries. See, for example, [Lee97| for an 
introduction to some of its main ideas and techniques. 
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Exercise 8.12. Show that lengths of curves are isometry invariants of Rie- 
mannian manifolds. More precisely, suppose (MM, g) and (M, g) are Riemann- 
ian manifolds, and F: M — M is an isometry. Show that Lg(Foy) = Ly(7) 
for any piecewise smooth curve segment y in M. 


Another extremely useful tool on Riemannian manifolds is orthonormal 
frames. Let (M, g) be an n-dimensional Riemannian manifold. A local frame 
(F,...,£,) for M defined on some open subset U C M is said to be 
orthonormal if (Ei|p,...,En|p) is an orthonormal basis for T,M at each 
point p € U, or in other words if (E;, Ej), = 645. 


Example 8.15. The coordinate frame (0/0x") is a global orthonormal 
frame on R”. 


Proposition 8.16 (Existence of Orthonormal Frames). Let (M,q) 
be a Riemannian manifold. For any p © M, there is a smooth orthonormal 
frame on a neighborhood of p. 


Proof. Let (x*) be any coordinates on a neighborhood U of p. Applying 
the Gram-Schmidt algorithm (Proposition A.17) to the coordinate frame 
(0/0x"), we obtain a new frame (E;), given inductively by the formula 


0/dxi — SI} (Ej, Ei) gE: 


7 [8/005 — CEE; BoB], 


Because span(Fi,...,£;-1) = span(0/0z!,...,0/0x7—'), the vector 
whose norm appears in the demoninator above is nowhere zero on U. Thus 
this formula defines E£; as a smooth vector field on U, and a computation 
shows that the resulting frame (£;) is orthonormal. 


Observe that Proposition 8.16 did not show that there are coordinates 
near p for which the coordinate frame is orthonormal. Problem 8-13 shows 
that there are such coordinates in a neighborhood of each point only if the 
metric is flat. 


The Riemannian Distance Function 


Using curve segments as “measuring tapes,” we can define a notion of 
distance between points on a Riemannian manifold. If (M, g) is a connected 
Riemannian manifold and p,q € M, the (Riemannian) distance between 
p and q, denoted by d,(p,q), is defined to be the infimum of L,(y) over 
all piecewise smooth curve segments y from p to g. Because any pair of 
points in a connected manifold can be joined by a piecewise smooth curve 
segment (Lemma 4.16), this is well-defined. 
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Example 8.17. On R” with the Euclidean metric g, one can show that 
any straight line segment is the shortest piecewise smooth curve segment 
between its endpoints (Problem 8-14). Therefore, the distance function dg 
is equal to the usual Euclidean distance: 


dg(x,y) = |a — yl. 


Exercise 8.13. If (M/,g) and (Mm ,g) are connected Riemannian manifolds 
and F: M — M is an isometry, show that dz(F'(p), F(¢)) = dg(p,¢) for all 
pqeM. 


We will see below that the Riemannian distance function turns M into 
a metric space whose topology is the same as the given manifold topology. 
The key is the following technical lemma, which shows that any Riemannian 
metric is locally comparable to the Euclidean metric in coordinates. 


Lemma 8.18. Let g be any Riemannian metric on an open set U C R”. 
For any compact subset K C U, there exist positive constants c,C such 
that for alla € K and allv€T,M, 


clulg < |ulg < Clouly. (8.5) 
Proof. For any compact subset Ik C U, let L C TR” be the set 
L={(a,v) € TR": re K, |vlg = 1}. 


Since DL is a product of compact sets in TR” = R” x R”, L is compact. 
Because the norm |v|, is continuous and strictly positive on L, there are 
positive constants c, C such that c < |v|g < C whenever (z,v) € L.Ifc«ek 
and v is any nonzero vector in TR”, let \ = |v|z. Then (x,A7~'v) € L, so 
by homogeneity of the norm, 


lulg = AA] SAC = Clog. 


A similar computation shows that |v|, > clvlz. The same inequalities are 
trivially true when v = 0. 


Proposition 8.19 (Riemannian Manifolds as Metric Spaces). Let 
(M, g) be a connected Riemannian manifold. With the Riemannian distance 
function, M is a metric space whose metric topology is the same as the 
original manifold topology. 


Proof. It is immediate from the definition that d,(p,q) > 0 for any p,q € 
M. Because any constant curve segment has length zero, it follows that 
d,(p,p) = 0, and d,(p,q) = d,(q,p) follows from the fact that any curve 
segment from p to q can be reparametrized to go from q to p. Suppose 7; and 
72 are piecewise smooth curve segments from p to q and q to r, respectively, 
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and let y be a piecewise smooth curve segment that first follows 7, and 
then follows yz (reparametrized if necessary). Then 


dg(p,r) < Lg(y) = Lg(y1) + Lg(72). 


Taking the infimum over all such 7; and 72, we find that d,(p,r) < dg(p, q)+ 
d,(q,r). (This is one reason why it is important to define the distance 
function using piecewise smooth curves instead of just smooth ones.) 

To complete the proof that (M,d,) is a metric space, we need only show 
that d,(p,q) > 0 if p # q. For this purpose, let p,q € M be distinct 
points, and let U be any coordinate domain containing p but not q. Use 
the coordinate map as usual to identify U with an open subset in R”, and 
let g denote the Euclidean metric in these coordinates. If V is a coordinate 
ball of radius ¢ centered at p such that V C U, Lemma 8.18 shows that 
there are positive constants c,C such that 


Xlg < |X1g < ClXI5 


whenever gq € V and X € T,M. Then for any piecewise smooth curve 
segment ¥ lying entirely in V, it follows that 


cLg(y) < Lg(y) < CLy(y). 


Suppose 7: [a,b] + M is a piecewise smooth curve segment from p to q. 
Let to be the infimum of all t € [a,b] such that y(t) ¢ V. It follows that 


(to) € OV by continuity, and y(t) € V for a <t < to. Thus 


L(y) = Lg (Vfa,to]) = Lg (Vfa,to]) = cdg(p, Y(to)) = ce. 


Taking the infimum over all such y, we conclude that d,(p, q) > ce > 0. 

Finally, to show that the metric topology generated by dg is the same as 
the given manifold topology on M, we will show that the open sets in the 
manifold topology are open in the metric topology and vice versa. Suppose 
first that U C M is open in the manifold topology. Let p be any point of 
U, and let V be a coordinate ball of radius ¢ around p such that V C U as 
above. The argument in the previous paragraph shows that d,(p,q) > ce 
whenever g ¢ V. The contrapositive of this statement is that d,(p,q) < ce 
implies g € V C U, or in other words the metric ball of radius ce around p 
is contained in U. This shows that U is open in the metric topology. 

Conversely, suppose that W is open in the metric topology, and let p € 
W. Let V be any closed coordinate ball around p, let g be the Euclidean 
metric on V determined by the given coordinates, and let c,C be positive 
constants such that (8.5) is satisfied for X € T,M, q € V. For any « > 0, 
let V- be the set of points whose Euclidean distance from p is less than 
e. If q € Vz, let y be the straight-line segment in coordinates from p to q. 
Arguing as above, (8.5) implies 


dg(p,q) < Lg(y) < CLg(y) = Ce. 
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If we choose € small enough that the closed metric ball of radius Ce around 
p is contained in W, this shows that V. C W. Since Vz is a neighborhood 
of p in the manifold topology, this shows that W is open in the manifold 
topology as well. 


A topological space is said to be metrizable if it admits a distance function 
whose metric topology is the same as the given topology. The next corollary 
is an immediate consequence of the preceding proposition. 


Corollary 8.20. Every smooth manifold is metrizable. 


Riemannian Submanifolds 


If (M, g) isa Riemannian manifold and S Cc M is an immersed submanifold, 
we can define a smooth symmetric 2-tensor g|s on S by g|s = t*g, where 
t: S — M is the inclusion map. By definition, this means for X,Y € T,S 


(gls)(X, xX) a u“g(X, Y) g(taX, txY) = G(X, Y); 


so g|g is just the restriction of g to vectors tangent to S. Since the restric- 
tion of an inner product to a subspace is still positive definite, g|g is a 
Riemannian metric on S, called the induced metric. In this case, S' is called 
a Riemannian submanifold of M. 


Example 8.21. The metric g = g|sn induced on S” from the Euclidean 
metric by the usual inclusion S” — R”*! is called the round metric on the 
sphere. 


If S is a Riemannian submanifold of (M, g), it is usually easiest to com- 
pute the induced metric g|s in terms of a local parametrization of S, which 
is a smooth embedding X: U — M whose image is an open subset of S. 
The coordinate representation of g|s with respect to the coordinate chart 
y = X~! is then the pullback metric X*g. The next two examples will 
illustrate the procedure. 


Example 8.22 (Riemannian Metrics in Graph Coordinates). Let 
U Cc R” be an open set, and let M C R”*! be the graph of the 
smooth function f: U — R. Then the map X: U — R"*! given by 
X(u',...,u") = (ul,...,u”, f(u)) is a (global) parametrization of M, and 
the induced metric on M is given in graph coordinates by 
X*g = X*((dz')? +--+ + (dxz"t")?) 
= (du+)? +----+ (du)? + df”. 


Example 8.23. Let D C R® be the embedded torus obtained by revolving 
the circle (y — 2)? + 2? = 1 around the z-axis. If X: R? — R®° is the map 


X(y, 6) = ((2 + cos vy) cos 8, (2 + cosy) sin @, sin y), 
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then the restriction of X to any sufficiently small open set U C R? is a local 
parametrization of D. The metric induced on D by the Euclidean metric 
is computed as follows: 
X*G = X* (da? + dy + dz”) 
= d((2 + cosy) cos 6)? + d((2 + cosy) sin 6)? + d(sin y)? 
= (—sin pcos 6 dt — (2 + cosy) sin 0 d0)? 
+ (—sin ysin @ dt + (2 + cos y) cos 6 dé)? 
+ (cos y dt)? 
(sin? y cos? 6 + sin? ysin? 6 + cos? y)dy? 


I 


cos y) sin y cos # sin 6 — (2 + cosy) sin y cos O sin 0) dip 
2 i Osin 6 — (2 i @sin 6)dy dé 
((2 + cos y)* sin? 6 + (2 + cos y)? cos? 6)d6” 

= dp? + (2+ cos y)*d0”. 


If (M,g) is an n-dimensional Riemannian manifold and S Cc M 
is a k-dimensional Riemannian submanifold, a local orthonormal frame 
(F\,...,£,) for M on an open set U C M is said to be adapted to S if 
(E4|p,---,Ex|p) is an orthonormal basis for T,S at each pe UNS. 


Proposition 8.24 (Existence of Adapted Orthonormal Frames). 
Let S C M be an embedded Riemannian submanifold of the Riemannian 
manifold (M,g). For each p € S, there is an adapted orthonormal frame 
on a neighborhood U of p in M. 


Proof. Let (x1,...,2") be slice coordinates for S on a neighborhood U 
of p, so that SMU is the set where e*+! = .-- = x” = 0. Applying 
the Gram-Schmidt algorithm to the frame (0/0z'), we obtain an ortho- 
normal frame (£,...,£,) with the property that span(£1|p,...,Exl|p) = 
span(0/dz1|,,...,0/Ox"|)) = TpS at each p € S. 


The Tangent-Cotangent Isomorphism 


Another very important feature of Riemannian metrics is that they provide 
a natural correspondence between tangent and cotangent vectors. Given a 
Riemannian metric g on a manifold M, define a bundle map g: TM — 
T*M by 


GX)(Y) =9p(X,Y) for X,Y €T,M. 


(Recall that a bundle map is a smooth map whose restriction to each fiber 
is a linear map from T,M to TM.) 


Exercise 8.14. Show that g is a bundle map. 
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Note that g is injective, because 9(X) = 0 implies 0 = g(X)(X) = 
9p(X, X), which in turn implies X = 0. For dimensional reasons, therefore, 
g is bijective, and so it is a bundle isomorphism (see Problem 5-12). 

In coordinates, 


IXY) = gis(P)X'Y!, 
which implies that the covector g(X) has the coordinate expression 
GX) = gij(v) X*dy’. 


In other words, the restriction of g to T,,M is the linear map whose matrix 
with respect to the coordinate bases for T,M and T; M is just the same as 
the matrix of g. 

It is customary to denote the components of the covector g(X) by 


Xj = gij(p)X*, 
so that 
G(X) = Xjdy’. 


Because of this, one says that g(X) is obtained from X by lowering an 
index. The notation X° is frequently used for g(X), because the symbol > 
(“flat”) is used in musical notation to indicate that a tone is to be lowered. 
Similarly, the inverse map g~!: T pM — T,M is represented by the inverse 
of the matrix (g;;). The components of this inverse matrix are usually 
denoted by g’’, so that 


9 95k = Gj GF” = Sk. 
Thus for a cotangent vector € € TM, g~'(&) has the coordinate represen- 
tation 
ge) <8 2 here ¢¢ = gi (p)é; 
g (S)=S' 5G, where €* = 9" (p)é). 


We use the notation €# (“€-sharp”) for g~1(€), and say that €* is obtained 
from € by raising an index. 

The most important use of the sharp operation is to recover the notion 
of the gradient as a vector field on Riemannian manifolds. For any smooth 
function f on a Riemannian manifold (M, g), we define a vector field grad f, 
called the gradient of f, by 


grad f = (df)* =g"* (df). 
Unraveling the definitions, for any X € T,,M, it satisfies 


(grad fp, X)g = g(grad flp)(X) = dfp(X) = Xf. 
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Thus grad f is the unique vector field that satisfies 
(grad f,X), = Xf for every vector field X, 


or equivalently, 


(grad f, +)y = df. 
In coordinates, grad f has the expression 


Of 
Ox* OxI 


grad f=g 
In particular, on R” with the Euclidean metric, this is just 


of 0 of a 


Ox? Oxi — + Oa? Oa?” 


grad f = 64 


Thus our new definition of the gradient in this case coincides with the gra- 
dient from elementary calculus, which is the vector field whose components 
are the partial derivatives of f. In other coordinates, however, the gradient 
will not generally have the same form. 


Example 8.25. Let us compute the gradient of a function f € C®(R?) 
in polar coordinates. From (8.4), we see that the matrix of g in polar 
coordinates is (j°), so its inverse matrix is (9 ,/,2). Inserting this into 
the formula for the gradient, we obtain 


Of 0 nN 1 Of O 
Or Orr? 00 00° 


grad f = 


Existence of Riemannian Metrics 
We end this section by proving the following important result. 


Proposition 8.26 (Existence of Riemannian Metrics). Every 
smooth manifold admits a Riemannian metric. 


Proof. We give two proofs. For the first, we begin by covering M by coor- 
dinate charts (U,, Ya). In each coordinate domain, there is a Riemannian 
metric gq given by the Euclidean metric 6;;dx‘dx’ in coordinates. Now let 
{wa} be a partition of unity subordinate to the cover {U,}, and define 


I= So Yaga: 


Because of the local finiteness condition for partitions of unity, there are 
only finitely many nonzero terms in a neighborhood of any point, so this 
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expression defines a smooth tensor field. It is obviously symmetric, so only 
positivity needs to be checked. If X € T,,M is any nonzero vector, then 


Ip(X, X) = S balp)galp(X, X). 


This sum is nonnegative, because each term is nonnegative. At least one of 
the functions 7, is strictly positive at p (because they sum to 1). Because 
Jalp(X, X) > 0, it follows that g,(X,X) > 0. 

The second proof is shorter, but relies on the Whitney embedding the- 
orem, which is far less elementary. We simply embed M in R™ for some 
N, and then the Euclidean metric induces a Riemannian metric gli on 
M. 


Pseudo-Riemannian Metrics 


An important generalization of Riemannian metrics is obtained by relaxing 
the requirement that the metric be positive definite. A 2-tensor g on a vector 
space V is said to be nondegenerate if it satisfies any of the following three 
equivalent conditions: 


e g(X,Y) =0 for all Y € V if and only if X =0. 
e The map g: V > V* defined by 9(X)(Y) = g(X,Y) is invertible. 
e The matrix of g with respect to any basis is nonsingular. 


Just as any inner product can be transformed to the Euclidean one by 
switching to an orthonormal basis, every nondegenerate symmetric 2-tensor 
can be transformed by a change of basis to one whose matrix is diagonal 
with all entries equal to +1. The numbers of positive and negative diag- 
onal entries are independent of the choice of basis; thus the signature of 
g, defined as the sequence (—1,...,—1,+1,...,+1) of diagonal entries in 
nondecreasing order, is an invariant of g. 

A pseudo-Riemannian metric on a manifold M is a smooth symmet- 
ric 2-tensor field that is nondegenerate at each point. Pseudo-Riemannian 
metrics whose signature is (—1,+1,...,+1) are called Lorentz metrics, they 
play a central role in physics, where they are used to model gravitation in 
Einstein’s general theory of relativity. 

We will not pursue the subject of pseudo-Riemannian metrics any fur- 
ther, except to note that neither of the proofs above of the existence of 
Riemannian metrics carries over to the pseudo-Riemannian case: in partic- 
ular, it is not always true that the restriction of a nondegenerate 2-tensor to 
a subspace is nondegenerate, nor is it true that a linear combination of non- 
degenerate 2-tensors with positive coefficients is necessarily nondegenerate. 
Indeed, it is not true that every manifold admits a Lorentz metric. 
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Problems 


8-1. 


8-4. 


8-5. 


8-6. 


Let V and W be finite-dimensional real vector spaces. Show that 
the tensor product V @ W is uniquely determined up to canonical 
isomorphism by its characteristic property (Proposition 8.3). More 
precisely, suppose 7: V x W — Z isa bilinear map into a vector space 
Z with following property: For any bilinear map A: VxW — Y, there 
is a unique linear map A: Z — Y such that the following diagram 
commutes: 


vxw—+Ly 
7 A 


Z. 


Then there is a unique isomorphism ®: V@W — Z such that 7 = Bo 
m. [This shows that the details of the construction used to define the 
tensor product are irrelevant, as long as the resulting space satisfies 
the characteristic property. ] 


. If V is any finite-dimensional real vector space, prove that there are 


canonical isomorphisms R@V =V=V OR. 


. Let V and W be finite-dimensional real vector spaces. Prove that 


there is a canonical (basis-independent) isomorphism between V* @W 
and the space Hom(V, W) of linear maps from V to W. 


Let M be a smooth n-manifold, and o a covariant k-tensor field on 
M. If (a*) and (Z?) are overlapping coordinate charts on M, we can 
write 


0 = 0%,..4, dz" @+-- @dx* =o =G;,5,dE" @---@ dz. 


Compute a transformation law analogous to (4.4) expressing the com- 


ponent functions 9;,...;, in terms of 0;,...;,. 


Generalize the change of coordinate formula of Problem 8-4 to mixed 
tensors of any rank. 


Let M be a smooth manifold. 


(a) Given a smooth covariant k-tensor field tT € T*(M), show that 
the map J(M) x --- x T(M) — C®(M) defined by 


(X1,...,X~E) > 7(X1,..., Xe) 
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is multilinear over C™(M), in the sense that for any smooth 
functions f, f’ € C°(M) and smooth vector fields X;, X/, 


Peg PREG ee I) 
Sf My as Mien os KOE Pes Rc 
(b) Show that a map 
T:T(M) x--. x T(M) — C°(M) 


is induced by a smooth tensor field as above if and only if it is 
multilinear over C™(M). 


8-7. Let V be an n-dimensional real vector space. Show that 


aa) _ (n+k—-1)! 


aimE*(V) = ( ‘ = Gee 


8-8. (a) Let T be a covariant k-tensor on a finite-dimensional real vector 
space V. Show that SymT is the unique symmetric k-tensor 
satisfying 


(Sant) (Xo RY) POR eX) 


for all X € V. 


(b) Show that the symmetric product is associative: For all symmet- 
ric tensors R,S,T, 


(RS)T = R(ST). 


(c) If w!,...,w* are covectors, show that 
1 
1 ko > (1) (k) 
wW il S- we) @ +++ @Qw? 
oES: 


8-9. Let g = glgn denote the round metric on the n-sphere, i.e., the metric 


induced from the Euclidean metric by the usual inclusion of S” into 
Rv, 


(a) Derive an expression for g in stereographic coordinates by com- 
puting the pullback (o~')*9. 


(b) In the case n = 2, do the analogous computation in spherical 
coordinates (x, y, z) = (sin y cos 6, sin y sin 0, cos y). 
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8-10. 


8-14. 


8-16. 


8. Tensors 


Let M be any smooth manifold. 


(a) Show that TM and T*M are isomorphic vector bundles. 


(b) Show that the isomorphism of part (a) is not canonical, in the 
following sense: There does not exist a rule that assigns to every 
smooth manifold M a bundle isomorphism Ayy: TM — T*M in 
such a way that for every smooth map F’: M — N, the following 
diagram commutes: 


TM TN 
Aa | |x 
T*M T*N. 


F* 


. Let I’ be a discrete group acting smoothly, freely, and properly on a 


smooth manifold M, and let M = M/T. Show that a Riemannian 
metric g on M is the pullback of a metric on M by the quotient map 
a: M — M if and only if g is invariant under T (ie., y*g = g for 
every y €T). 


. Let (M,g) and (M,g) be Riemannian manifolds. Suppose F: M > 


M is a smooth map such that F*g = g. Show that F is an immersion. 


. Let (M,g) be a Riemannian manifold. Show that the following are 


equivalent: 
(a) Each point of M has a coordinate neighborhood in which the 
coordinate frame is orthonormal. 
(b) g is flat. 
Show that the shortest path between two points in Euclidean space 


is a straight line. More precisely, for x,y € R”, let y: [0,1] — R” be 
the curve segment 


y(t) = (1 — te + ty, 


and show that any other piecewise smooth curve segment ¥ from x 
to y satisfies L(y) > Lz(y). [Hint: First consider the case in which 
both z and y lie on the x!-axis.] 


. Let M = R? \ {0} with the Euclidean metric g, and let p = (1,0), 


q = (—1,0). Show that there is no piecewise smooth curve segment + 
from p to q in M such that Lz(y) = dg(p, ¢). 


Let (1M, g) be a Riemannian manifold, and let f ¢ C@(M). 


8-17. 
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(a) For any p € M, show that among all unit vectors X € T,M, the 
directional derivative X f is greatest when X points in the same 
direction as grad f|,, and the length of grad f|, is equal to the 
value of the directional derivative in that direction. 


(b) If p is a regular point of f, show that grad f|, is orthogonal to 
the level set of f through p. 


Let T? =S! x--- x S' Cc C", and let g be the metric on T” induced 
from the Euclidean metric on C” (identified with R?”). Show that g 
is flat. 


. Let (M, g) be a Riemannian manifold and let S C M bea Riemannian 


submanifold. If p € S, a vector N € TM is said to be normal to S 
if N is orthogonal to T,,S with respect to g. Show that the set of 
all vectors normal to S is a smooth vector bundle over S, called the 
normal bundle to S. [Hint: use adapted orthonormal frames. ] 


. If S C M is an embedded submanifold, a smooth map NV: S —- TM 


such that N, € T,M for each p € S is called a vector field along 
S. If (M,g) is a Riemannian manifold and S$ Cc M is a Riemannian 
submanifold of codimension 1, show that every p € S has a neighbor- 
hood on which there exist exactly two unit-length vector fields along 
S that are normal to S. 
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Differential Forms 


In the previous chapter, we introduced symmetric tensors—those whose val- 
ues are unchanged by interchanging any pair of arguments. In this chapter, 
we explore the complementary notion of alternating tensors, whose values 
change sign whenever two arguments are interchanged. The main focus of 
the chapter is differential forms, which are just alternating tensor fields. 
These innocent-sounding objects play an unexpectedly important role in 
smooth manifold theory, through two applications. First, as we will see in 
Chapter 10, they are the objects that can be integrated in a coordinate- 
independent way over manifolds or submanifolds; second, as we explore in 
Chapter 11, they provide a link between analysis and topology by way of 
the de Rham theorem. 

We begin the chapter with a heuristic discussion of the measurement of 
volume, to motivate the central role played by alternating tensors. We then 
proceed to study the algebra of alternating tensors. The most important 
algebraic construction is a product operation called the wedge product, 
which takes alternating tensors to alternating tensors. Then we transfer 
this to manifolds, and introduce the exterior derivative, which is a natural 
differential operator on differential forms. 

At the end of the chapter, we introduce symplectic forms, which are a 
particular type of differential form that play an important role in geometry, 
analysis, and mathematical physics. 
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The Heuristics of Volume Measurement 


In Chapter 4, we introduced line integrals of covector fields, which general- 
ize ordinary integrals to curves in manifolds. As we will see in subsequent 
chapters, it is also useful to generalize the theory of multiple integrals to 
manifolds. 

How might we make coordinate-independent sense of multiple integrals? 
First, observe that there is no way to define integrals of functions in a 
coordinate-independent way on a manifold. It is easy to see why, even in 
the simplest possible case: Suppose CC R” is an n-dimensional cube, and 
f: C > Ris the constant function f(x) = 1. Then 


| faV =Vol(C), 
Cc 


which is clearly not invariant under coordinate transformations, even if we 
just restrict attention to linear ones. 

Let us think a bit more geometrically about why covector fields are the 
natural fields to integrate along curves. A covector field assigns a number to 
each tangent vector, in such a way that multiplying the tangent vector by 
a constant has the effect of multiplying the resulting number by the same 
constant. Thus a covector field can be thought of as assigning a “signed 
length meter” to each one-dimensional subspace of the tangent space, and 
it does so in a coordinate-independent way. Computing the line integral of 
a covector field, in effect, assigns a “length” to a curve by using this varying 
measuring scale along the points of the curve. 

Now we wish to seek a kind of “field” that can be integrated in a 
coordinate-independent way over submanifolds of dimension k > 1. Its 
value at each point should be something that we can interpret as a “signed 
volume meter” on k-dimensional subspaces of the tangent space—a ma- 
chine 2 that accepts any k tangent vectors (X1,...,X,) at a point and 
returns a number (0(.Xj,...,X;) that we might think of as the “signed vol- 
ume” of the parallelepiped spanned by those vectors, measured according 
to a scale determined by 2. 

The most obvious example of such a machine is the determinant in R”. 
For example, it is shown in most linear algebra texts that for any two vectors 
X 1, X2 € R?’, det(X1, X2) is, up to a sign, the area of the parallelogram 
spanned by X1, X2. It is not hard to show (see Problem 9-1) that the 
analogous fact is true in all dimensions. The determinant, remember, is an 
example of a tensor. In fact, it is a tensor of a very specific type: It changes 
sign whenever two of its arguments are interchanged. A covariant k-tensor 
T on a finite-dimensional vector space V is said to be alternating if it has 
this property: 


SO CRIED, ORREUN CNS, Ch te” LC. Che mmmD, Corman Cimmene®. 619) 
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X1 


Xo 


cX9 


FIGURE 9.1. Scaling by a constant. FIGURE 9.2. Sum of two vectors. 


Let us consider what properties we might expect a general “signed vol- 
ume meter” 2 to have. To be consistent with our ordinary ideas of volume, 
we would expect that multiplying any one of the vectors by a constant c 
should cause the volume to be scaled by that same constant (Figure 9.1), 
and that the parallelepiped formed by adding together two vectors in the 
ith place results in a volume that is the sum of the volumes of the two 
parallelepipeds with the original vectors in the ith place (Figure 9.2): 


O(X1,...,0XG,.-., Xn) = (NX, ..., Xi,..., Xn), 
BRD Fs as. BND. CoRR, 9) 


Hide iy Ke aes ea 


These two requirements suggest that Q should be multilinear, and thus 
should be a covariant k-tensor. 

There is one more essential property that we should expect: Since n lin- 
early dependent vectors span a parallepiped of zero n-dimensional volume, 
Q should give the value zero whenever it is applied to n linearly dependent 
vectors. As the next lemma shows, this forces 9 to be an alternating tensor. 


Lemma 9.1. Suppose 2 is ak-tensor on a vector space V with the property 
that O(X1,...,X~) = 0 whenever X1,...,X% are linearly dependent. Then 
Q is alternating. 
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Proof. The hypothesis implies, in particular, that 0 gives the value zero 
whenever two of its arguments are the same. This in turn implies 


COM ig 5 ep REO 
ON ia fap ARR tsa Me OS as ag Ree a 
AON caceca Mies city a brane oP OM face cere Snack OG) 


BONN iy Mears GN aaa Meg AE Mig ret Rg og een 


Thus 2 is alternating. 


Because of these considerations, alternating tensor fields are promising 
candidates for objects that can be integrated in a coordinate-independent 
way. We will develop these ideas rigorously in the remainder of this chapter 
and the next; as we do, you should keep this geometric motivation in mind. 


The Algebra of Alternating Tensors 


In this section, we set aside heuristics and start developing the technical 
machinery for working with alternating tensors. For any finite-dimensional 
real vector space V, let A*(V) denote the subspace of T*(V) consisting 
of alternating tensors. [Warning: Some authors use the notation A*(V*) 
in place of A*(V) for this space; see Problem 9-8 for a discussion of the 
reasons why.] An alternating k-tensor is sometimes called a k-covector. 

Recall that for any permutation o € Sx, the sign of 0, denoted by sgno, 
is equal to +1 if o is even (i.e., can be written as a composition of an even 
number of transpositions), and —1 if o is odd. 

The following exercise is an analogue of Exercise 8.9. 


Exercise 9.1. Show that the following are equivalent for a covariant k- 
tensor T: 


(a) T is alternating. 


(b) For any vectors X1,...,X, and any permutation o € Sx, 
T(Xeqy,--+, Xo(k)) = (sgno)T(X1,..., Xx). 
(c) T gives zero whenever two of its arguments are equal: 
DXi ncn Voneoy Voienp dcr) = Ok 


(d) T(X1,...,Xx) = 0 whenever the vectors (X1,...,X;) are linearly in- 
dependent. 


(e) With respect to any basis, the components Tj,...4, of T change sign 
whenever two indices are interchanged. 
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Notice that part (d) implies that there are no nonzero alternating k- 
tensors on V if k > dim V, for then every k-tuple of vectors is dependent. 

Every 0-tensor (which is just a real number) is alternating, because there 
are no arguments to interchange. Similarly, every 1-tensor is alternating. 
An alternating 2-tensor is just a skew-symmetric bilinear form on V. It is 
interesting to note that any 2-tensor T’ can be expressed as the sum of an 
alternating tensor and a symmetric one, because 


T(X,Y) = 3(T(X,Y) —T(¥, X)) + 3(P(XY) + TY, X)) 
= A(X,Y) + 9(X,Y), 

where A(X,Y) = $(T(X,Y) — T(Y,X)) is alternating, and S(X,Y) = 
3(T(X,Y)+T(Y,X)) is symmetric. This is not true for tensors of higher 
rank, as Problem 9-2 shows. 

The tensor S defined above is just Sym 7’, the symmetrization of T de- 
fined in the preceding chapter. We define a similar projection Alt: T*(V) > 
A*(V), called the alternating projection, as follows: 


1 oO 
AltT = A S- (sgno)T”?. 
oES: 
More explicitly, this means 
1 
(AltT)(X1,---, Xe) =F S> (sgno)T(Xo(1)1-+-; Xo(x)): 
o€S. 


Example 9.2. If T is any 1-tensor, then Alt T = T. If T is a 2-tensor, 
then 


Alt T(X,Y) = 5(T(X, Y)—T(Y,X)). 
For a 3-tensor T, 


MT OY A) = AT Ye) OG KG RY) 


AGG F) XZ Y= EZ Ye): 


1 
6 


The next lemma is the analogue of Lemma 8.10. 


Lemma 9.3 (Properties of the Alternating Projection). 
(a) For any tensor T, Alt T is alternating. 
(b) T is alternating if and only if AltT = T. 


Exercise 9.2. Prove Lemma 9.3. 
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Elementary Alternating Tensors 
Let k be a positive integer. An ordered k-tuple I = (i1,...,%,) of positive 
integers is called a multi-index of length k. If J and J are multi-indices such 
that J is obtained from J by a permutation o € Sx, in the sense that 

at = to(1)s ae S| Tk — to(k)> 
then we write J = oI. It is useful to extend the Kronecker delta notation 


in the following way. If J and J are multi-indices of length k, we define 


sgno if neither J nor J has a repeated index 
and J = ol for some a € Sx, 


67 = 
t 0 if J or J has a repeated index 
or J is not a permutation of J. 
Let V be an n-dimensional vector space, and suppose (e!,...,€”) is 


any basis for V*. We will define a collection of alternating tensors on 
V that generalize the determinant function on R”. For each multi-index 
I = (i1,...,%%) of length k such that 1 < 41,...,%% <n, define a covariant 
k-tensor e! by 


e1(X1) ...  e% (Xx) 
e'(X1,...,Xp) = det 
etk(X1) 1... et (X, 
a) se aa 
DO in IO 
= det : : 
KO aos OMS 


In other words, if X denotes the matrix whose columns are the components 
of the vectors X1,...,X, with respect to the basis (E;) dual to (<’), then 
e!(X1,..., Xz) is the determinant of the k x k minor consisting of rows 
i1,...,t,% of X. Because the determinant changes sign whenever two columns 
are interchanged, it is clear that e/ is an alternating k-tensor. We will call 


e an elementary alternating tensor or elementary k-covector. 


For example, in terms of the standard dual basis (e', e?, e?) for (R?)*, 


we have 
BY) Haya Ve, 
eX ¥, X)—det(X,Y,Z). 


Lemma 9.4. Let (E;) be a basis for V, let (e’) be the dual basis for V*, 
and let <! be as defined above. 


(a) If I has a repeated index, then e! = 0. 
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(b) If J=ol for some o € Sx, then ce! = (sena)e’. 


(c) The result of evaluating e’ on a sequence of basis vectors is 


Proof. If I has a repeated index, then for any vectors Xj,...,Xx, the 
determinant in (9.1) has two identical rows and thus is equal to zero, which 
proves (a). On the other hand, if J is obtained from J by interchanging 
two indices, then the corresponding determinants have opposite signs; this 
implies (b). 

To prove (c), we consider several cases. First, if J has a repeated index, 
then e! = 0 by part (a). If J has a repeated index, then e/(Ej,,...,£j,) = 
0 by Exercise 9.1(c). If neither multi-index has any repeated indices but 
J is not a permutation of J, then the determinant in the definition of 
e!(Ej,,...,E;,) has at least one row of zeros, so it is zero. If J = J, 
then e/(Ej,,..., E;,) is the determinant of the identity matrix, which is 1. 
Finally, if J = ol, then 


by part (b). 


The significance of the elementary k-covectors is that they provide a con- 
venient basis for A*(V). Of course, the e/ are not all independent, because 
some of them are zero and the ones corresponding to different permutations 
of the same multi-index are constant multiples of each other. But, as the 
next lemma shows, we can get a basis by restricting attention to an appro- 
priate subset of multi-indices. A multi-index I = (i1,...,i,) is said to be 
increasing if i; < +++ < i,. It will be useful to use a primed summation sign 
to denote a sum over only increasing multi-indices, so that, for example, 


S~ Tye! = S- Tre!. 
I 


{I:1<i1 <++-<ipn<n} 


Lemma 9.5. Let V be an n-dimensional vector space. If (e*) is any basis 
for V*, then the collection of k-covectors 


{e! : I is increasing} 


is a basis for A¥(V). Therefore, 


dim A*(V) = & = Ce 
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Proof. Let (E;) be the basis for V dual to (e*), and let € = {e! 
I is increasing}. We need to show that the set € spans A*(V) and is inde- 
pendent. 

To show that € spans, let T € A*(V) be arbitrary. For each multi-index 
I = (41,...,%%), define a real number T; by 


Ty = ees ao ee emp 
The fact that T is alternating implies that 7; = 0 if J contains a repeated 


multi-index, and Ty = (sgna)Ty if J = oI for o € Sx. For any multi-index 
J, Lemma 9.4 gives 


/ / 
ee) Se SS EB: 
I I 


Therefore, ar Tye! = T, so € spans A*(V). 
To show that € is an independent set, suppose 


S~ Tye! =0 
dT. 


for some coefficients T;. Let J be any increasing multi-index. Applying 
both sides to (£;,,...,£;,) and using Lemma 9.4, 


’ 
C=. Te (Ese, Fy) = T7. 
I 


Thus each coefficient Ty is zero. 


In particular, for an n-dimensional vector space V, this lemma implies 
that A"(V) is 1-dimensional, and is spanned by ¢!-". By definition, this 
elementary n-covector acts on vectors (X1,...,X») by taking the deter- 
minant of the component matrix X = (X 4). For example, on R” with the 
standard basis, e!-” is precisely the determinant function. Since there are 
no increasing multi-indices of length greater than n, the space A*(V) is 
trivial for k > n. 


The Wedge Product 


In Chapter 8, we defined the symmetric product, which takes a pair of 
symmetric tensors S,T and yields another symmetric tensor ST = Sym(S'@ 
T) whose rank is the sum of the ranks of the original ones. 

In this section, we will define a similar product operation for alternat- 
ing tensors. One way to define it would be to mimic what we did in the 
symmetric case and define the product of alternating tensors w and 7 to 
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be Alt(w ® 7). However, we will use a different definition that looks more 
complicated at first but turns out to be much better suited to computation. 

If w € A*(V) and 7 € A'(V), we define the wedge product or exterior 
product of w and 7 to be the alternating (k + 1)-tensor 


wAn= aa Alt(w @ ). (9.2) 


The mysterious coefficient is motivated by the simplicity of the statement 
of the following lemma. 


Lemma 9.6. For any multi-indices I = (i1,...,in) and J = (ji,.--,J1); 
Brine Sees, (9.3) 


where IJ is the multi-index (i1,..-,%k,J1,---, ju) obtained by concatenating 
I and J. 


Proof. By multilinearity, it suffices to show that 


Tong IJ 
ale a CS ee OREN Beat ual © 5 Peete ©) ORS (9.4) 
for any sequence (Ep,,...,£p,,,) of basis vectors. We consider several cases. 


CASE I: P = (pi,...,Pr+i) has a repeated index. In this case, both sides 
of (9.4) are zero by Exercise 9.1(c). 

CASE II: P contains an index that does not appear in either I or J. In 
this case, the right-hand side is zero by Lemma 9.4(c). Similarly, each term 
in the expansion of the left-hand side involves either e! or 7 evaluated on 
a sequence of basis vectors that is not a permutation of J or J, respectively, 
so the left-hand side is also zero. 

CasE III: P= IJ and P has no repeated indices. In this case, the right- 
hand side of (9.4) is equal to 1 by Lemma 9.4(c), so we need to show that 
the left-hand side is also equal to 1. By definition, 


Se (Boi 8 


9°~PR+L 
(k +0)! 
Sa Alt(e? @ e7 (Bas bates Bg) 
1 
= on S- (sgn Oe (Epes whee Peas Je" (Esc pees Epa ay) 
rig ESR 


By Lemma 9.4 again, the only terms in the sum above that give nonzero 
values are those in which o permutes the first k indices and the last /| 
indices of P separately. In other words, 0 must be of the form o = 77, 
where T € S, acts by permuting {1,...,4} and 7 € S; acts by permuting 
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{K+1,...,k +1}. Since sgn(7n) = (sgn) (sgn 7), we have 


EAS (Ey jess Png) 
1 
a kt Ss (sgn T) (sgn me! (Ep, ay MeED ES Ep, (x) Je? eae ates 5 Eon aaa) 
TES 
reg 


1 
(; S- (sgn ne? (Epsiads PS Boyes) 


NES 
= (Alve") (Bi js 4sg Bp, ) (Alte? (CE Ec) 


Pkt1?°**0°~Pk+l 
eH, cvivabyele UE Biyiaz) 
= 1. 


Pk+12°°* 9 Pk+tl 


CASE IV: P is a permutation of IJ. In this case, applying a permutation 
to P brings us back to Case III. Since the effect of the permutation is to 
multiply both sides of (9.4) by the same sign, the result holds in this case 
as well. 


Proposition 9.7 (Properties of the Wedge Product). 
(a) BILINEARITY: 


(aw + a'w") An = a(w An) +a'(w' An), 
nN (aw + a’u’) =a(nAw)t+a'(n Aw’). 


(b) ASSOCIATIVITY: 


wr (nA €)=(wAn) NE. 


(c) ANTICOMMUTATIVITY: For w € A¥(V) andn€ A'(V), 


wAn=(-1)"nAw. 


(d) If (e1,...,€") is any basis for V* and I = (i1,...,i%) is any multi- 
index, 


el Av. A et mel, (9.5) 


(e) For any covectors w!,...,w* and vectors X1,...,Xk, 


wi A+++Aw*(X1,...,Xp) = det(w'(X;)). (9.6) 
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Proof. Bilinearity follows immediately from the definition, because the ten- 
sor product is bilinear and Alt is linear. To prove associativity, note that 
Lemma 9.6 gives 


(ef Ne*) ANe® =e ne® = eK mel net® =e! A (e4 Ne*). 


The general case follows from bilinearity. Similarly, using Lemma 9.6 again, 
we get 


el Ae? =e!4 = (sen r)e7! = (sgn tle? Ae’,” 


where 7 is the permutation that sends IJ to JI. It is easy to check that 
sent = (—1)", because 7 can be decomposed as a composition of kl trans- 
positions (each index of J must be moved past each of the indices of J). 
Anticommutativity then follows from bilinearity. 

Part (d) is an immediate consequence of Lemma 9.6 and induction. To 
prove part (e), we note that the special case in which each w’ is one of 
the basis covectors ¢') just reduces to (9.5). Since both sides of (9.6) are 


multilinear in (w!,...,w*), this suffices. 


Because of part (d) of this lemma, we will generally use the notations e/ 


and e A--- Ae interchangeably. 

The definition and computational properties of the wedge product can 
seem daunting at first sight. However, the only properties that you need 
to remember for most practical purposes are that it is bilinear, associative, 
and anticommutative, and satisfies (9.6). In fact, these properties determine 
the wedge product uniquely, as the following exercise shows. 


Exercise 9.3. Show that the wedge product is the unique associative, bi- 
linear, and anticommutative map A*(V) x A'(V) = A**!(V) satisfying (9.6). 


As we observed at the beginning of this section, one could also define 
the wedge product without the unwieldy coefficient of (9.2). Many authors 
choose this alternative definition of the wedge product, which we denote 
by A: 


win = Alt(w®n). (9.7) 


Using this definition, (9.3) is replaced by 


kl! 
Ix .J 1 
A = 
e NE a nie 
and (9.6) is replaced by 
wiA---Aw*(Xy,...,Xk) = Zq det(w'(X;)) (9.8) 
whenever w!,...,w" are covectors, as you can check. 
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Because of (9.6), we will call the wedge product defined by (9.2) the 
determinant convention for the wedge product, and the wedge product 
defined by (9.7) the Alé convention. Although the definition of the Alt 
convention is perhaps a bit more natural, the computational advantages of 
the determinant convention make it preferable for most applications, and 
we will use it exclusively in this book. 


Differential Forms on Manifolds 


Now we turn our attention to an n-dimensional smooth manifold M. The 
subset of T*M consisting of alternating tensors is denoted by A*M: 


AKM = [J A*(q)M). 
pEeM 


Exercise 9.4. Show that A*M is a smooth subbundle of T*M, and is 
therefore a smooth vector bundle of rank (7) over M. 


A smooth section of A’ M is called a differential k-form, or just a k-form; 
this is just a smooth tensor field whose value at each point is an alternating 
tensor. We denote the vector space of sections of A*M by A*(M). (We 
ordinarily denote the space of sections of a vector bundle by the upper- 
case script letter corresponding to the name of the bundle; in this case, we 
use A because of the typographical similarity between A and A.) 

In any coordinate chart, a k-form w can be written locally as 


/ ¢ s / 
w= So wrde A+. Ada* =~ wrds", 
T T 


where the coefficients wy are smooth functions defined on the coordinate 
neighborhood, and we use dz! as an abbreviation for dx“ A\---Ada** (not 
to be mistaken for the differential of a function x’). In terms of differential 
forms, the result of Lemma 9.4(c) translates to 


ae! No nas’ ( ne g ) =8 


Oxi?" "? Oxrde 


The wedge product of a k-form with an [-form is a (&k + 1)-form. A 0-form 
is just a real-valued function, and we interpret the wedge product f A 7 of 
a 0-form f with a k-form 7 to mean the product 7. 


Example 9.8. A 1-form is just a smooth covector field. On R?, some ex- 
amples of 2-forms are given by 


w = (sin zy) dy A dz; 
n = dx \ dy + dx A dz + dy dz. 
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Every n-form on R” is a smooth function times dz! A --- A dx”, because 
there is only one increasing multi-index of length n. 


If F: M — N is a smooth map and w is a differential form on N, the 
pullback F*w is a differential form on M, defined as for any smooth tensor 
field: 


F*w(X1,...,X~) =W(PX,..., Xp). 
In particular, if 1: N — M is the inclusion map of an immersed submani- 
fold, then we usually use the notation w|, for v*w. 
Lemma 9.9. Suppose F: M — N is smooth. 
(a) F*(w An) = (F*w) A (F*n). 


(b) In any coordinate chart, 


F* (So era Arts A ay") 
I 
= So wre F) diy? o F) A+ Ady"* oF). 
iE 


Exercise 9.5. Prove this lemma. 


This lemma gives a computational rule for pullbacks of differential forms 
similar to the one we developed for arbitrary tensor fields in the preceding 
chapter. As before, it can also be used to compute the expression for a 
differential form in another coordinate chart. 


Example 9.10. Let w be the 2-form dz A dy on R?. Thinking of the trans- 
formation to polar coordinates x = rcos6@, y = rsin@ as an expression for 
the identity map with respect to different coordinates on the domain and 
range, we find 


w = dx A dy 
= d(rcos@) A d(rsin@) 
= (cos dr — rsin0@ dé) A (sin 6 dr + r cos 6 dé) 
= rcos? 0dr A dé — rsin? 6d A dr, 


where we have used the fact that drAdr = d@Ad@ = 0 by anticommutativity. 
Because dé A dr = —dr A dé, this simplifies to 


dx \ dy =rdr A dé. 


The similarity between this formula and the formula for changing a dou- 
ble integral from Cartesian to polar coordinates is striking. More generally, 
we have the following lemma: 
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Lemma 9.11. Let F: M — N be a smooth map between n-manifolds. If 
(x*) and (y’) are coordinates on open sets U C M andV CN, respectively, 
and u is a smooth function on V, then the following holds on UN F-1(V): 


OFI 
F* (udy' \---A dy”) = (wo F) det (=) da’ A+++ Adz”. (9.9) 


Proof. Because the fiber of AM is spanned by dx! A --- A dx” at each 
point, it suffices to show that both sides of (9.9) give the same result when 
evaluated on (0/0z',...,0/0x”). From Lemma 9.9, 


F* (udy' A--- Ady") = (uo F)dF! \--»AdF”. 


Proposition 9.7(e) shows that 


0 0 [Od OFI 
Ta... mp 7 we fi Le eee = 
ae nn dt (s2oy22) = at (art (<2) = cee (2%), 


Therefore, the left-hand side of (9.9) gives (uo F’) det(OF? /Ox") when ap- 
plied to (0/0z',...,0/0x"). On the other hand, the right-hand side gives 
the same thing, because dz! \ --- A dx"(0/0z",...,0/0x") = 1. 


Exterior Derivatives 


In this section, we define a natural differential operator on forms, called the 
exterior derivative. It is a generalization of the differential of a function. 

To give some idea where the motivation for the exterior derivative comes 
from, let us look back at a question we addressed in Chapter 4. Recall 
that not all smooth covector fields are differentials of functions: Given w, 
a necessary condition for the existence of a function f such that w = df is 
that w be closed, which means that it satisfies 


Ox? = Oxd 


=0 (9.10) 


in every coordinate system. Since this is a coordinate-independent property 
by Lemma 4.25, one might hope to find a more invariant way to express it. 
The key is that the expression in (9.10) is antisymmetric in the indices 7 
and j, so it can be interpreted as the 7j-component of an alternating tensor 
field, i.e., a 2-form. We will define a 2-form dw by 


= Ow; Ow; 4 j 
aw = (F - pot) ae A dx’, 


i<j 


so it follows that w is closed if and only if dw = 0. 
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This formula has a significant generalization to differential forms of all 
degrees. For any manifold, we will show that there is a differential operator 
d: Ak(M) — A**1(M) satisfying d(dw) = 0 for all w. Thus it will follow 
that a necessary condition for a k-form w to be equal to dy for some (k—1)- 
form 7 is that dw = 0. 

The definition of d in coordinates is straightforward: 


a(S write") = S~ dwt A dz, (9.11) 
T ii 


where dwy is just the differential of the function w;. In somewhat more 
detail, this is 


a(S wrde" Ave ac’) = Sy ot! A dx™ A+++ Ada. 
I I a 


Observe that when w is a 1-form, this becomes 


: Ow; 7 ; Ow; Ow; if ; 
d(w; dx?) = da! \ de! = De (2 = a) dx’ \ dx, 
VJ 
using the fact that dx’ A dri = —dx/ A dz’, so this is consistent with our 


earlier definition. 
Proving that this definition is independent of the choice of coordinates 
takes a little work. This is the content of the next theorem. 


Theorem 9.12 (The Exterior Derivative). On any smooth manifold 
M, there is a unique linear map d: A*(M) — A**!(M) for each k > 0 
satisfying the following conditions: 


(i) If f is a smooth function (a 0-form), then df is the differential of f, 
defined as usual by 


(ii) If w € A¥(M) and n € A'(M), then 
d(w An) = dw An+(—-1)*w A dn. 


(iii) d? =0. More precisely, for any k-form w, d(dw) = 0. 
In any local coordinates, d is given by (9.11). 


Proof. First we will prove uniqueness. Suppose d: A*(M) — A**!(M) is 
a linear operator satisfying (i), (ii), and (iii). We will show that it also sat- 
isfies (9.11) in any local coordinate chart, which implies that it is uniquely 
determined. 
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We begin by showing that d is local, in the following sense: If w and w are 
k-forms on M that agree on an open subset U C M, then dw = dw on U. 
Writing 7 = @ —w, it clearly suffices to show that dy = 0 on U if 7 vanishes 
on U. Let p € U be arbitrary, and let yp € C™(M) be a bump function 
that is equal to 1 in a neighborhood of p and supported in U. Then yy is 
identically zero on M, so 


0 = d(pn)p = dep \ Mp + 9(P)dnp = an, 
because y = 1 in a neighborhood of p. Since p was an arbitrary point of U, 
this shows that dy = 0 on U. 

Suppose (x') are coordinates on an open subset U C M. Let w € A*(M), 
and write w = >>; wydz! in coordinates on U. We will show that (9.11) 
holds at each point p € U. By the extension lemma (Lemma 2.20), we 
can extend the coordinate functions x* to smooth functions Z* on all of 
M that agree with x’ in some neighborhood of p. Likewise, we extend the 
component functions w7; to functions w; on M. The k-form w = ae wdz" A 
-.- A dz*® is globally defined on M and agrees with w near p. Using linearity 
of d together with (i) and (ii), we compute 


dw = a(S Brae" Nowe A aa) 
I 
= So diz A db N+ N dB + (-1)°Q Grd (a A+ Nd), 
- I 


because Wy; is a 0-form. Now using (ii) again, the last term expands into a 
sum of terms, each of which contains a factor of the form d(dz*”), which is 
zero by (iii). Therefore, since d is local, 


/ i 3 / 7 _ 
diy = dip = N° diit|p A dX A-- A dti* = S~ dwr|p Ada A--- AN dai. 


I I 
Since p was arbitrary, (9.11) holds on all of U. This proves that dw is 
uniquely determined. 

Now to prove that such an operator exists, we begin by assuming that 
M is covered by a single coordinate chart, and define d by (9.11). It is 
clearly linear and satisfies (i). We need to check that it satisfies (ii) and 
(iii). Before doing so, we need to know that d satisfies 


d(f dx” A-+» A dx**) = df \dx™ A+» \ dat 
for any multi-index J, not just increasing ones. If J has repeated indices, 


then clearly both sides are zero. If not, let o be the permutation sending I 
to an increasing multi-index J. Then 


d(f dx” \---Adz**) = (sgna)d(f dx” A --- A dai) 
= (sena)df A dx \ +++ A dxt* 
= df Adx®™ A--- Adz". 
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To prove (ii), it suffices to consider terms of the form w = fdzx! = 
fdx®™ \-+»Adx** and n = gdx’. We compute 


d(w An) = d((f dx’) A (gdx”)) 
=d(fgdz' A dx’) 
= (gdf + f dg) Adz! A dx? 
= (df \ dx’) A (g dx’) + (—1)*(f dz") A (dg A dx”) 
= dw An+(-1)*w A dn, 
where the (—1)* comes from dg A dx’ = (—1)*dz! A dg. 


We will prove (iii) first for the special case of a 0-form, i.e., a smooth 
real-valued function. In this case, 


OF x 
d(df) =d (sh ae! 


af a j 
+ Ong cn § 
44 \ Ox'OxI = OxI Ox? 
VJ 


) dx’ \ dx 


For the general case, we use the k = 0 case together with (ii) to compute 
d(dw) = a(S der Adz™ A.A ac’) 
I 
= S~d(duy) Ada” A+++ dav'* 
I 


k 
+ S~ o(- dor A dx A+++ A d(dat3) A+++ A dat 
I j=l 


= 0. 


Finally, we consider the case of an arbitrary manifold M. On any coordi- 
nate domain U C M, we have a unique linear operator dy defined as above 
and satisfying (i)—(iii). On any set U MU’ where two charts overlap, the 
restrictions of dyw and dy:w to UNU’ must agree by uniqueness; therefore, 
defining dw by (9.11) in each coordinate chart, we get a globally-defined 
operator satisfying (i)—(iii). 


The operator d whose existence and uniqueness are asserted in this theo- 
rem is called exterior differentiation, and dw is called the exterior derivative 
of w. The exterior derivative of a real-valued function f is, of course, just 
its differential df. 
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Example 9.13. Let us work out the exterior derivatives of arbitrary 1- 
forms and 2-forms on R’. Any 1-form can be written 


w= Pdx+Qdy+ Rdz 


for some smooth functions P,Q, R. Using (9.11) and the fact that the wedge 
product of any 1-form with itself is zero, we compute 


dw =dPAdx+dQAdy+dRA dz 
= (Sea are sts) Adz 4 (Sear + re saz) A dy 
xv Yy z 


dx dy dz a a a 
(Grae | ty st) A dz 

= (2-5) da A dy+ (= -s) dz \ dx 
+ (SE - ) ayaa. 


It is interesting to note that the components of this 2-form are exactly the 
components of the curl of the vector field with components (P, Q, R) (except 
perhaps in a different order and with different signs). We will explore this 
connection in more depth later in the book. 

An arbitrary 2-form on R? can be written 


w=adxr\dy+ Bdx \ dz+ydyA dz. 
A similar computation shows 


dw = (Se - Ft P) ae nay nae. 


One important feature of the exterior derivative is that it behaves well 
with respect to pullbacks, as the next lemma shows. 


Lemma 9.14. If G: M — N is a smooth map, then the pullback map 
G*: AF(N) = A*(M) commutes with d: For allw € A*(N), 
G* (dw) = d(G*w) (9.12) 


Proof. Let w € A*(N) be arbitrary. Because d is local, if (9.12) holds in 
a neighborhood of each point, then it holds on all of M. In a coordinate 
neighborhood, w can be written as a sum of terms like f dr® A--- A dx**, 
so by linearity it suffices to check (9.12) for a form of this type. 

For such a form, the left-hand side of (9.12) is 


G*d(f dx A-+» A dx*) = G*(df Ada A-+-A dz) 
= d(foG)Ad(a" oG)A--+Ad(x** 0G), 
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while the right-hand side is 
dG* (f dx” A--- A da) = d((f 0G) d(a® 0G) A---Ad(x* oG)) 
= d(foG)Ad(a" oG)A-++Ad(z** 0G). 


Extending the terminology that we introduced for covector fields in 
Chapter 4, we say that a differential form w € A*(M) is closed if dw = 0, 
and exact if there exists a (k — 1)-form 7 on M such that w = dy. The fact 
that d? = 0 implies that every exact form is closed. The converse may not 
be true, as we saw already in Chapter 4 the case of covector fields. We will 
return to these ideas in Chapter 11. 


Symplectic Forms 


In this section, we introduce symplectic forms, a special kind of 2-form 
that plays a leading role in many applications of smooth manifold theory 
to analysis and physics. 

We begin with some linear algebra. Recall that a 2-tensor w on a finite- 
dimensional real vector space V is said to be nondegenerate if w(X,Y) = 0 
for all Y € V implies X = 0. 


Exercise 9.6. Show that the following are equivalent for a 2-tensor w on 

a finite-dimensional vector space V: 

(a) w is nondegenerate. 

(b) The matrix (w;;) representing w in terms of any basis is nonsingular. 

(c) The linear map w: V — V* defined by @(X)(Y) = w(X,Y) is invert- 
ible. 


A nondegenerate alternating 2-tensor is called a symplectic tensor. A vec- 
tor space V endowed with a specific symplectic tensor is called a symplectic 
vector space. 


Example 9.15. Let V be a vector space of dimension 2n. Choose any basis 
for V, and denote the basis by (Aj, Bi,..., An, Bn) and the corresponding 
dual basis for V* by (at, 31,...,a”, 8”). Let w € A?(V) be the 2-covector 
defined by 


w= So oA (9.13) 
i=l 


Note that the action of w on basis vectors is given by 


w(A;,A;) = w(B;, B;) = 0. (9.14) 
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Suppose X = a’ A; + b'B; € V satisfies w(X,Y) = 0 for all Y € V. Then 
0 = w(X, Bj) = a’ and 0 = w(X, Aj) = —b', which implies that X = 0. 
Thus w is nondegenerate. 


It is useful to consider the special case in which dim V = 2. In this case, 
every 2-covector is a multiple of a! A Z', which is nondegenerate by the 
argument above. Thus every nonzero 2-covector on a 2-dimensional vector 
space is symplectic. 

If (V,w) is a symplectic vector space and S C V is any subspace, we 
define the symplectic complement of S, denoted by S+, to be the subspace 


St ={X €V:0(X,Y) =0 for all Y € S}. 


As the notation suggests, the symplectic complement is analogous to the 
orthogonal complement in an inner product space. For example, just as in 
the inner product case, the dimension of S+ is the codimension of S, as 
the next lemma shows. 


Lemma 9.16. Let (V,w) be a symplectic vector space. For any subspace 
SCV, dimS+dimS+ =dimV. 


Proof. If (&,...,E%) is any basis for S$, then the covectors 
w(F),...,@(E;,) C V* are independent because w is injective. Therefore, 
the map ®: V — R* given by 


O(X) = (W(F1)(X),-.-,@(Ex)(X)) 


is surjective, so S+ = Ker © has dimension equal to dim V — dim R* by the 
rank-nullity law. 


Symplectic complements differ from orthogonal complements in one im- 
portant respect: Although it is always true that SM $+ = {0} in an inner 
product space, this need not be true in a symplectic vector space. Indeed, 
if S is 1-dimensional, the fact that w is alternating forces w(X, X) = 0 for 
every X € S,so S C S+. Carrying this idea a little further, subspaces of 
V can be classified in the following way. A subspace S' C V is said to be 


e symplectic if SA $+ = {0}; 
e isotropic if S Cc S+; 

e coisotropic if S > S+; 

e Lagrangian if S = S+. 


Exercise 9.7. Let (V,w) be a symplectic vector space, and let S C V be 
a subspace. 


(a) Show that (S+)+ = S. 
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(b) Show that S' is symplectic if and only if w|s is nondegenerate. 
(c) Show that S' is isotropic if and only if w|s = 0. 
(d) Show that S' is Lagrangian if and only if w|s =0 and dimS = n. 


The symplectic tensor w defined in Example 9.15 turns out to be the 
prototype of all symplectic tensors, as the next proposition shows. This 
can be viewed as a symplectic version of the Gram-Schmidt algorithm. 


Proposition 9.17 (Canonical Form for a Symplectic Tensor). Let 
w be a symplectic tensor on an m-dimensional vector space V. Then V has 
even dimension m = 2n, and there exists a basis for V in which w has the 
form (9.13). 


Proof. It is easy to check that w has the form (9.13) with respect to a basis 
(Ai, Bi,...,An, Bn) if and only if the action of w on basis vectors is given 
by (9.14). Thus we will prove the theorem by induction on m = dim V, by 
showing that there exists a basis of this form. 

For m = 0 there is nothing to prove. Suppose (V,w) is a symplectic 
vector space of dimension m > 1, and assume the proposition is true for all 
symplectic vector spaces of dimension less than m. Let A; be any nonzero 
vector in V. Since w is nondegenerate, there exists B, € V such that 
w(A1,B,) £0. Multiplying B, by a constant if necessary, we may assume 
that w(Ai, B,) = 1. Because w is alternating, B,; cannot be a multiple of 
Aj, so the set {A;, B,} is independent. 

Let S Cc V be the subspace spanned by {A1, Bi}. Then dim $+ = m-—2 
by Lemma 9.16. Since w|s is obviously nondegenerate, by Exercise 9.7 
it follows that S is symplectic. This means SM S+ = {0}, so S+ is 
also symplectic. By induction $+ is even-dimensional and there is a ba- 
sis (Ag, Bo,..., An, Bn) for S+ such that (9.14) is satisfied for 2 < i,j <n. 
It follows easily that (A1, Bi, A2, Bo,...,An, Bn) is the required basis for 
V. 


Because of this proposition, if (V,w) is a symplectic vector space, a basis 
(Ai, Bi,...,An, Bn) for V is called a symplectic basis if (9.14) holds, which 
is equivalent to w being given by (9.13) in terms of the dual basis. The 
proposition then says that every symplectic vector space has a symplectic 
basis. 

Now let us turn to manifolds. A symplectic form on a smooth manifold 
is a closed, nondegenerate 2-form. In other words, a 2-form w is symplectic 
if and only if it is closed and w, is a symplectic tensor for each p € M. A 
smooth manifold endowed with a specific choice of symplectic form is called 
a symplectic manifold. A choice of symplectic form is also sometimes called 
a symplectic structure on M. Proposition 9.17 implies that a symplectic 
manifold must be even-dimensional. If (M/,w) and (M,W) are symplectic 


manifolds, a diffeomorphism F': M — M satisfying F*® = w is called a 
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symplectomorphism. The study of properties of symplectic manifolds that 
are invariant under symplectomorphisms is known as symplectic geometry. 


Example 9.18 (Symplectic Manifolds). 


(a) If we denote the standard coordinates on R2” by (xt, y',...,2",y"), 
the 2-form 


w= dx’ A dy’ 
i=1 


is symplectic: It is obviously closed, and its value at each point is 
the standard symplectic tensor of Example 9.15. This is called the 
standard symplectic structure (or standard symplectic form) on R?”. 


(b) Suppose © is any smooth 2-manifold and (2 is any nonvanishing 2- 
form on %. Then 2 is closed because dQ. is a 3-form, and every 3-form 
on a 2-manifold is zero. Moreover, as we observed above, every nonva- 
nishing 2-form is nondegenerate, so (X,{2) is a symplectic manifold. 


Suppose (M,w) is a symplectic manifold. An (immersed or embedded) 
submanifold N Cc M is said to be symplectic, isotropic, coisotropic, or 
Lagrangian if T,.N (thought of as a subspace of T,M) has this property 
at each point p € N. More generally, an immersion f’': N — M is said to 
have one of these properties if the subspace F.(ZpN) C Try) M has the 
corresponding property for every p € N. Thus a submanifold is symplectic 
(isotropic, etc.) if and only if its inclusion map has the same property. 


Exercise 9.8. Suppose (M,w) is a symplectic manifold, and F': N — M 
is an immersion. Show that F is isotropic if and only if F*w = 0, and F is 
symplectic if and only if F*w is a symplectic form. 


The most important example of a symplectic manifold is the total space 
of the cotangent bundle of any smooth manifold M, which carries a canon- 
ical symplectic structure that we now define. First, there is a natural 1- 
form 7 on the total space of T*M, called the tautologous 1-form, defined 
as follows. A point in T*M is a covector n € TM for some p € M; we 
will denote such a point by the notation (p,7). The natural projection 
a: T*M — M is then just m(p,7) = p, and its pullback is a linear map 
n*: TSM — To n)(T*M). We define 7 € Al(T*M) by 


T(p,n) = 7 N- 


In other words, the value of r at (p,7) € T*M is the pullback with respect 
to 7 of the covector 7 itself. If X is a tangent vector in T(),»)(T*M), then 


T(p,n)(X) = (+X). 
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Proposition 9.19. Let M be a smooth manifold. The tautologous 1-form 
T is smooth, and w = —dr is a symplectic form on the total space of T*M. 


Proof. Let (x*) be any coordinates on M, and let (x, €;) denote the corre- 
sponding standard coordinates on T*M as defined in Proposition 4.4. Re- 
call that the coordinates of (p,7) € T*M are defined to be (2', €;), where 
(x*) is the coordinate representation of p and €; dx’ is the coordinate repre- 
sentation of 7. In terms of these coordinates, the projection 7: T*M — M 
has the coordinate expression m(z,€) = x, and therefore the coordinate 
representation of 7 is 


Tip =m (ede) = Cede" 


It follows immediately that 7 is smooth, because its component functions 
are linear. 
Clearly w is closed, because it is exact. Moreover, 


w=-dr= S > dx ndé. 


Under the identification of an open subset of T*M with an open sub- 
set of R?” by means of these coordinates, w corresponds to the standard 
symplectic form on R?” (with €; substituted for y’). It follows that w is 
symplectic. 


The symplectic structure defined in this proposition is called the canoni- 
cal symplectic structure on T*M. One of its many uses is in giving a some- 
what more “geometric” picture of what it means for a 1-form to be closed, 
as shown by the following proposition. 


Proposition 9.20. Let M be a smooth manifold, and let a be a 1-form on 
M. Thought of as a smooth map from M to T*M, o is an embedding, and 
a is closed if and only if its image o(M) is a Lagrangian submanifold of 
T*M. 


Proof. Throughout this proof, we need to remember that o is playing two 
roles: On the one hand, it is a 1-form on M, and on the other hand, it is 
a smooth map between manifolds. Since they are literally the same map, 
we will not use different notations to distinguish between them; but you 
should be careful to think about which role a is playing at each step of the 
argument. 

In terms of any local coordinates (a) for M and the corresponding stan- 
dard coordinates (x, €;) for T*M, the map o: M — T*M has the coordi- 
nate representation 
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where o; dx* is the coordinate representation of o as a 1-form. It follows 
immediately that o is an immersion, and the fact that it is injective follows 
from 700 = Idy. 

To show that it is an embedding, it suffices by Proposition 5.4 to show 
that it is a proper map. This follows easily from the fact that moo = Idy: If 
K Cc T*M isa compact set, then o~'(K) is a closed subset of the compact 
set (4), and so is compact. 

Because o(M) is n-dimensional, it is Lagrangian if and only if it is 
isotropic, which is the case if and only if o*~w = 0. The pullback of the 
tautologous form 7 under ga is 


or=a(& da’) =o;dv' =o. 
This can also be seen somewhat more invariantly from the computation 
(0°7) p(X) = To(p) (O%X) = Op(T+o2X) = op(X), 


which follows from the definition of 7 and the fact that too = Idyy. 
Therefore, 


ow = —o'drt = —d(o*T) = —do. 


It follows that o is a Lagrangian embedding if and only if do = 0. 
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9-1. Let v1,...,Un be any n vectors in R”, and let P be the n-dimensional 
parallelepiped spanned by them: 


P= {tru +--+ +thtn :0<t; <1}. 
Show that Vol(P) = det(v1,..., Un). 


9-2. Let (e', e?,e%) be the standard dual basis for (R°)*. Show that e! @ 
e? @e° is not equal to a sum of an alternating tensor and a symmetric 


tensor. 
9-3. Show that covectors w!,...,w* on a finite-dimensional vector space 


are linearly dependent if and only if wt A---Aw* =0. 


9-4. Show that two k-tuples {w!,...,w*} and {7!,...,7*} of independent 
covectors have the same span if and only if 


wirh--Aw*® =en' A---AnF 
for some nonzero real number c. 


9-5. A k-covector 7 on a finite-dimensional vector space V is said to be 
decomposable if it can be written 


where w!,...,w* are covectors. Is every 2-covector on V decompos- 


able? Your answer will depend on the dimension of V. 
9-6. Define a 2-form Q on R? by 
Q=adyAdz+ydzAdx+zdz A dy. 

(a) Compute Q in spherical coordinates (p,y,0) defined by 
(x,y, 2) = (sin ycos 0, sin y sin 0, cos ¢). 

(b) Compute dQ in both Cartesian and spherical coordinates and 
verify that both expressions represent the same 3-form. 

(c) Compute the restriction Q|g2 = e*, using coordinates (y, #), on 
the open subset where these coordinates are defined. 


(d) Show that Q|g2 is nowhere zero. 


9-7. In each of the following problems, g: M — N is a smooth map be- 
tween manifolds M and N, and w is a differential form on N. In each 
case, compute g*w and dw, and verify by direct computation that 


g* (dw) = d(g*w). 


226 


9. Differential Forms 


(a) g: R? — R? by 


(x,y) = g(s, t) = (st, e’); 
w= axdy. 


(b) g: {(r,0) : r > 0} — R? by 


(x, y) = (rcos@,rsin 6); 
w= dy Adz. 


(c) g: R? — R? by 


(x,y,z) = 9(9, ~) = ((cos yp + 2) cos 9, (cos y + 2) sin 9, sin y); 
w= ydzAdz. 


(d) g: {(u,v) su? +0? < 1} > R38 \ {0} by 
(x,y,z) = (u,v, V1 = w? = v?); 
w = (a? + y? + 27)-3/2 (a dy A dz + ydz A dx + zdx A dy). 
(e) 9: {(7,9,y) «7 > 0} > R® by 


(x,y, 2) = (rcos@sin y, r sin sin y, r cos vy); 
w=axdyA\dz+ydzAdxa+ zdzx dy. 


9-8. Let V be a finite-dimensional real vector space. We have two ways 


to think about the tensor space T*(V): concretely, as the space of 
k-multilinear functionals on V; and abstractly, as the tensor product 
space V* @--- ®@ V*. However, we have defined alternating and sym- 
metric tensors only in terms of the concrete definition. This problem 
outlines an abstract approach to alternating tensors. 


Let A denote the subspace of V* ®--- ®V™* spanned by all elements 
of the form a ®w ®w ® @ for covectors w and arbitrary tensors a, 3, 
and let A*(V*) denote the quotient vector space V* @--- @ V*/A. 
Define a wedge product on A*(V*) by wA7 = 1(@ @ 7), where 
mt: V* @-+»@V* > A*(V*) is the projection, and ®, 7 are arbitrary 
tensors such that 7(W) = w, 7(7) = 7. Show that this wedge product 
is well defined, and that there is a unique isomorphism F’: A*(V*) > 
A*(V) such that the following diagram commutes: 


& 


V*@---@V* TAY) 
7 Alt 
A*(V*) A*(V), 


9-13. 
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and show that F' takes the wedge product we just defined on A*(V*) 
to the Alt convention wedge product on A*(V). [This is one reason 
why some authors consider the Alt convention for the wedge product 
to be more natural than the determinant convention. It also explains 
why some authors prefer the notation A*(V*) instead of A*(V) for 
the space of alternating covariant k-tensors, since it is a quotient of 
the kth tensor product of V* with itself.] 


. Let (V,w) be asymplectic vector space of dimension 2n, and let S Cc V 


be a subspace. Show that V has a symplectic basis (A;, B;) with the 
following property: 
(a) If S is symplectic, S = span(Aj1, Bi,..., Ax, By) for some k. 
(b) If S is isotropic, S = span(A,,...,A,) for some k. 


(c) If S is coisotropic, S = span(A1,...,An, Bi,..., Bx) for some 
k. 


(d) If S is Lagrangian, S' = span(Aj,..., An). 


. Let M be asmooth manifold, and let N be an embedded submanifold 


of the total space of T*M. Show that M is the image of a closed 1- 
form on M if and only if N is Lagrangian and the projection map 
am: T*M — M restricts to a diffeomorphism N — M. 


. Show that there is no 1-form o on M such that the tautologous form 


rt € Al(T*M) is equal to the pullback 1*o. 


. Let (M,w) and (M,&) be symplectic manifolds. Define a 2-form Q 


on M x M by 
Q= rw 7G, 


where m: Mx M — M and%: MxM > M are the projections. For 
a smooth map F': M — N, let [(F) Cc M x M be the graph of F: 


T(F) ={(a,y) € Mx M:y= F(a)}. 


Show that F' is a symplectomorphism if and only if T'(£’) is a La- 
grangian submanifold of (M x M,Q). 


The (real) symplectic group is the subgroup Sp(2n,R) C GL(2n, R) 
of matrices leaving the standard symplectic form w = S7i_, dx’ Ady’ 
invariant, that is, the set of invertible linear maps A: R?” — R?” 
such that A*w = w. 


(a) Show that a matrix A is in Sp(2n, R) if and only if it takes the 
standard basis to a symplectic basis. 
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(b) Show that A € Sp(2n,R) if and only if A7 JA = J, where J is 
given in block form as 


0 Ln 
pe (_ pe ) | 
(c) Show that Sp(2n, R) is an embedded Lie subgroup of GL(2n, R), 
and determine its dimension. 


9-14. Let A, C G(n, 2n) denote the set of Lagrangian subspaces of R?”. 


(a) Show that Sp(2n, R) acts transitively on Ay. 


(b) Show that A, has a unique smooth manifold structure such that 
the action of Sp(2n, R) is smooth, and determine its dimension. 
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Integration on Manifolds 


We introduced differential forms in the previous chapter with a promise 
that they would turn out to be objects that can be integrated on manifolds 
in a coordinate-independent way. In this chapter, we fulfill that promise by 
defining the integral of a differential n-form over a smooth n-manifold. 

Before doing so, however, we need to address a serious issue that we have 
so far swept under the rug. This is the small matter of the positive and 
negative signs that arise when we try to interpret a k-form as a machine 
for measuring k-dimensional volumes. In the previous chapter, we brushed 
this aside by saying that the value of a k-form applied to k vectors has to 
be interpreted as a “signed volume” of the parallelepiped spanned by the 
vectors. These signs will cause problems, however, when we try to integrate 
differential forms on manifolds, for the simple reason that the transforma- 
tion law (9.9) for an n-form involves the determinant of the Jacobian, while 
the change of variables formula for multiple integrals involves the absolute 
value of the determinant. In the first part of this chapter, we develop the 
theory of orientations, which which is a systematic way to restrict to co- 
ordinate transformations with positive determinant, thus eliminating the 
sign problem. 

Next we address the theory of integration. First we define the integral 
of a differential form over a domain in Euclidean space, and then we show 
how to use diffeomorphism invariance and partitions of unity to extend this 
definition to the integral of a compactly supported k-form over an oriented 
k-manifold. The key feature of the definition is that it is invariant under 
orientation-preserving diffeomorphisms. 
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Next we prove one of the most fundamental theorems in all of differential 
geometry. This is Stokes’s theorem, which is a generalization of the funda- 
mental theorem of calculus, as well as of the three great classical theorems 
of vector analysis: Green’s theorem for vector fields in the plane; the di- 
vergence theorem for vector fields in space; and (the classical version of) 
Stokes’s theorem for surface integrals in R°. We also describe an extension 
of Stokes’s theorem to manifolds with corners, which will be useful in our 
treatment of the de Rham theorem in Chapter 11. 

In the last section of the chapter, we show how these ideas play out on a 
Riemannian manifold. In particular, we prove Riemannian versions of the 
divergence theorem and of Stokes’s theorem for surface integrals, of which 
the classical theorems are special cases. 


Orientations 


The word “orientation” has some familiar meanings from our everyday ex- 
perience, which can be interpreted as rules for singling out certain bases 
of R', R?, and R°. For example, most people would understand an “ori- 
entation” of a line to mean a choice of preferred direction along the line, 
so we might declare an oriented basis for R! to be one that points to the 
right (i.e., in the positive direction). A natural class of bases for R? is the 
ones for which the rotation from the first vector to the second is in the 
counterclockwise direction. And every student of vector calculus encoun- 
ters “right-handed” bases in R?: These are the bases (F,, E2, E3) with the 
property that when the fingers of your right hand curl from EF to Fe, your 
thumb points in the direction of F3. 

Although “to the right,” “counterclockwise,” and “right-handed” are not 
mathematical terms, it is easy to translate the rules for selecting oriented 
bases of R!, R?, and R® into rigorous mathematical terms: You can check 
that in all three cases, the preferred bases are the ones whose transition 
matrix from the standard basis has positive determinant. 

In an abstract vector space for which there is no canonical basis, we no 
longer have any way to determine which bases are “correctly oriented.” For 
example, if V is the space of polynomials of degree at most 2, who is to 
say which of the ordered bases (1, 2, x?) or (x7, 2,1) is “right-handed”? All 
we can say in general is what it means for two bases to have the “same 
orientation.” 

Thus we are led to introduce the following definition. Let V be a vector 


space of dimension n > 1. We say two ordered bases (£1,...,E,) and 
(Fy,...,£,) are consistently oriented if the transition matrix (Bs), defined 
by 


E, = BiE,, (10.1) 
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has positive determinant. 


Exercise 10.1. Show that being consistently oriented is an equivalence 
relation on the set of all ordered bases for V, and show that there are exactly 
two equivalence classes. 


If dim V = n > 1, we define an orientation for V as an equivalence class 
of ordered bases. If (£1,..., En) is any ordered basis for V, we denote the 
orientation that it determines by [F,..., £,]. A vector space together with 
a choice of orientation is called an oriented vector space. If V is oriented, 
then any ordered basis (F1,...,,) that is in the given orientation is said 
to be oriented or positively oriented. A basis that is not oriented is said to 
be negatively oriented. 

For the special case of a 0-dimensional vector space V, we define an 
orientation of V to be simply a choice of one of the numbers +1. 


Example 10.1. The orientation [e1,...,¢€,] of R” determined by the stan- 
dard basis is called the standard orientation. You should convince yourself 
that, in our usual way of representing the axes graphically, an oriented ba- 
sis for R is one that points to the right; an oriented basis for R? is one for 
which the rotation from the first vector to the second is counterclockwise; 
and an oriented basis for R® is a right-handed one. (These can be taken 
as mathematical definitions for the words “right,” “counterclockwise,” and 
“right-handed.” ) The standard orientation for R° is defined to be +1. 


There is an important connection between orientations and alternating 
tensors, expressed in the following lemma. 


Lemma 10.2. Let V be a vector space of dimension n > 1, and suppose 2 
is a nonzero element of A"(V). The set of ordered bases (Fy,..., Fn) such 
that Q(F4,..., En) > 0 is an orientation for V. 


Proof. Let Og denote the set of ordered bases on which Q gives positive 
values. We need to show that Og is exactly one equivalence class. 

Choose one basis (£1,..., Ey) such that Q(£1,..., En) > 0. (Such a basis 
can always be found by starting with an arbitrary basis and replacing EF by 
—F, if necessary.) Let (e1,...,€”) denote the dual basis. Since eA -+-Ae” 
is a basis for A”(V), there is some (necessarily positive) number c such that 
C= ce heres. 


Now if (F,...,£,) is any other basis, with transition matrix (Ai) de- 
fined by 
E; = A}E,, 
we have 


O(F1,..., En) = cel A+++ Ae™(Et,..., En) 
= cdet(e*(E;)) 
= cdet(A¥). 
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It follows that (E;) is consistently oriented with (£;) if and only if 
Q(E,,...,£n) > 0, and therefore Og = [F1,..., En]. 


If V is an oriented vector space and Q is an n-covector that determines 
the orientation of V as described in this lemma, we say that 9 is an oriented 
(or positively oriented) n-covector. For example, the n-covector e+ A--+Ae” 
is positively oriented for the standard orientation on R”. 


Orientations of Manifolds 


Let M be a smooth manifold. We define a pointwise orientation on M to 
be a choice of orientation of each tangent space. By itself, this is not a very 
useful concept, because the orientations of nearby points may have no rela- 
tion to each other. For example, a pointwise orientation on R” might switch 
randomly from point to point between the standard orientation and its op- 
posite. In order for orientations to have some relationship with the smooth 
structure, we need an extra condition to ensure that the orientations of 
nearby tangent spaces are consistent with each other. 

Suppose M is a smooth n-manifold with a given pointwise orientation. 
Recall that a local frame for M is an n-tuple of smooth vector fields 
(£,,...,E,) on an open set U C M such that (£;|,) forms a basis for T,M 
at each p € U. We say that a local frame (F;) is (positively) oriented if 
(Ei|p,---,En|p) is a positively oriented basis for TM at each point p € U. 
A negatively oriented frame is defined analogously. 

A pointwise orientation is said to be continuous if every point is in the 
domain of an oriented local frame. An orientation of M is a continuous 
pointwise orientation. An oriented manifold is a smooth manifold together 
with a choice of orientation. We say M is orientable if there exists an 
orientation for it, and nonorientable if not. 

If M is 0-dimensional, this definition just means that an orientation of 
M is a choice of +1 attached to each of its points. The local constancy 
condition is vacuous in this case, and the notion of oriented frames is not 
useful. Clearly every 0-manifold is orientable. 


Exercise 10.2. If M is an oriented manifold of dimension n > 1, show 
that every local frame with connected domain is either positively oriented 
or negatively oriented. 


The next two propositions give ways of specifying orientations on mani- 
folds that are somewhat more practical to use than the definition. A smooth 
coordinate chart (U, vy) is said to be (positively) oriented if the coordinate 
frame (0/0z") is positively oriented, and negatively oriented if the coordi- 
nate frame is negatively oriented. A collection of charts {(Ua, a)} is said 
to be consistently oriented if for each a, 3, the transition map yg oy, has 
positive Jacobian determinant everywhere on ya (Ua MN Uz). 
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Proposition 10.3. Let M be a smooth positive-dimensional manifold, and 
suppose we are given an open cover of M by consistently oriented charts 
{(Ua, Ga)}. Then there is a unique orientation for M with the property that 
each chart Yq is oriented. Conversely, if M is oriented, then the collection 
of all oriented charts is a consistently oriented cover of M. 


Proof. For any p € M, the consistency condition means that the transition 
matrix between the coordinate bases determined by any two of the charts 
in the given collection has positive determinant. Thus the coordinate bases 
for all of the given charts determine the same orientation on T,M. This 
defines a pointwise orientation on M. Each point of M is in the domain of 
at least one of the given charts, and the corresponding coordinate frame 
is oriented by definition, so this pointwise orientation is continuous. The 
converse is similar, and is left as an exercise. 


Exercise 10.3. Complete the proof of Proposition 10.3. 


Proposition 10.4. Let M be a smooth manifold of dimension n > 1. A 
nonvanishing n-form Q € A"(M) determines a unique orientation of M 
for which Q is positively oriented at each point. Conversely, if M is given 
an orientation, then there is a nonvanishing n-form on M that ts positively 
oriented at each point. 


Remark. Because of this proposition, any nonvanishing n-form on an n- 
manifold is called an orientation form. If M is an oriented manifold and Q 
is an orientation form determining the given orientation, we also say that 2 
is (positively) oriented. It is easy to check that if Q and 2 are two positively 
oriented forms on the same orientated manifold M, then Q= fQ for some 
strictly positive smooth function f. 

If M is a 0-manifold, the proposition remains true if we interpret an 
orientation form as a nonvanishing function Q, which assigns the orientation 
+1 to points where 2 > 0 and —1 to points where 2 < 0. 


Proof. Let Q be a nonvanishing n-form on M. Then 2 defines a pointwise 
orientation by Lemma 10.2, so all we need to check is that it is continuous. 
Let (a) be any local coordinates on a connected domain U Cc M. Writing 
Q = fdz' A---Adzx” on U, the fact that Q is nonvanishing means that f 
is nonvanishing, and therefore 


) ) 
0 (Been ger) =sA0 


at all points of U. Since U is connected, it follows that this expression is 
either always positive or always negative on U, and therefore the coordi- 
nate chart is either positively oriented or negatively oriented. If negatively, 
we can replace x! by —z! to obtain a new coordinate chart for which the 
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coordinate frame is positively oriented. Thus the pointwise orientation de- 
termined by 2 is continuous. 

Conversely, suppose M is oriented. Let {(Ua,~a)} be the collection of 
all oriented charts for M. In each coordinate domain U,, the n-form Q, = 
dz! \ --. \ dx” is positively oriented. Let {wa} be a partition of unity 
subordinate to the cover {U,}, and define 


Q= > pale. 


By the usual argument, 2 is a smooth n-form on M. To complete the proof, 
we need to show that Q never vanishes. 

Let p € M be arbitrary, and let (£1,...,E,) be an oriented basis for 
T,M. For each a such that p € Ua, we have Qa|p(£1,..., En) > 0. Since 
wa(p) = 0 for all other @ and there is at least one a for which Wa(p) > 0, 
we have 


(Fay) En)= > Palp)Qalp(Fi,.--,En) > 0, 
{a:pEeUa} 


Thus Q, 4 0. 


Exercise 10.4. Show that any open subset of an orientable manifold is 
orientable, and any product of orientable manifolds is orientable. 


Recall that a smooth manifold is said to be parallelizable if it admits a 
global frame. 


Proposition 10.5. Every parallelizable manifold is orientable. 


Proof. Suppose M is parallelizable, and let (F,,..., £,) be a global frame 
for M. Define a pointwise orientation by declaring (£1|p,...,En|p) to be 
positively oriented at each p € M. This pointwise orientation is continuous, 
because every point of M is in the domain of the (global) oriented frame 


Example 10.6. The preceding proposition shows that Euclidean spaces 
R”, the n-torus T”, the spheres S! and S?, and products of them are all 
orientable, because they are all parallelizable. Therefore any open subset 
of one of these manifolds is also orientable. 


Let M and N be oriented positive-dimensional manifolds, and let 
F: M—N bea local diffeomorphism. We say F is orientation-preserving 
if for each p € M, F, takes oriented bases of T,M to oriented bases of 
Trip) N, and orientation-reversing if it takes oriented bases of T,M to neg- 
atively oriented bases of Trip) N. 


Exercise 10.5. Show that a smooth map F: M — N is orientation- 
preserving if and only if its Jacobian matrix with respect to any oriented 
coordinate charts for M and N has positive determinant, and orientation- 
reversing if and only if it has negative determinant. 


Orientations of Hypersurfaces 235 
Orientations of Hypersurfaces 


If M is an oriented manifold and N is a submanifold of M, N may not 
inherit an orientation from M, even if N is embedded. Clearly it is not 
sufficient to restrict an orientation form from M to N, since the restriction 
of an n-form to a manifold of lower dimension must necessarily be zero. A 
useful example to consider is the Mobius band, which is not orientable (see 
Problem 10-5), even though it can be embedded in R®. 

In this section, we will restrict our attention to (immersed or embedded) 
submanifolds of codimension 1, commonly called hypersurfaces. With one 
extra piece of information (a certain kind of vector field along the hyper- 
surface), we can use an orientation on M to induce an orientation on any 
hypersurface N c M. 

We start with some definitions. Let V be a finite-dimensional vector 
space, and let X € V. We define a linear map ix: A*V — A*~!V, called 
interior multiplication or contraction with X, by 


ixw(¥i,..-,¥e—-1) = w(X,Yi,..., Ye-1). 


In other words, 7xw is obtained from w by inserting X into the first slot. 
By convention, we interpret ixw to be zero when w is a 0-covector (i.e., a 
number). Another common notation is 


X Iw =ixw. 


Interior multiplication shares two important properties with exterior dif- 
ferentiation: They are both antiderivations whose square is zero, as the 
following lemma shows. 


Lemma 10.7. Let V be a finite-dimensional vector space and X EV. 
(a) ax ie) ax = 0. 
(b) ix ts an antiderivation: If w is a k-covector and y is an l-covector, 


ix(w An) = (ixw) Ant (-1)'w A (ixn). 


Proof. On k-covectors for k > 2, part (a) is immediate from the definition, 
because any alternating tensor gives zero when two of its arguments are 
identical. On 1-covectors and 0-covectors, it follows from the fact that ix = 
0 on 0-covectors. 

To prove (b), it suffices to consider the case in which both w and 7 
are wedge products of covectors (such an alternating tensor is said to be 
decomposable), since every alternating tensor can be written locally as a 
linear combination of decomposable ones. It is easy to verify that (b) will 
follow in this special case from the following general formula for covectors 
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k 
X a(wi A+ Aw*) = S0(-1) Twi (X)wt A+ Awt A+ -Aw*, (10.2) 
w=1 


where the hat indicates that w’ is omitted. 
To prove (10.2), let us write X; = X and apply both sides to vectors 
(Xo,...,X%); then what we have to prove is 


(wi A---Aw*)(Xy,...,Xk) 


(1)? twi(Xp)(wt A. Awi A+++ Aw*)(Xo,...,Xe). (10.3) 


I 
M- 


The left-hand side of (10.3) is the determinant of the matrix X whose 
(i, j)-entry is w’(X;). To simplify the right-hand side, let X{ denote the 
(k — 1) x (k — 1) minor of X obtained by deleting the ith row and jth 
column. Then the right-hand side of (10.3) is 


k 
S 0 (-1)1w"(X1) det Xi. 


i=l 


This is just the expansion of det X by minors along the first column 
(w!(X1),...,w*(X1)), and therefore is equal to det X. 


It should be noted that when the wedge product is defined using the Alt 
convention, interior multiplication has to be defined with an extra factor 
of k: 


ixw(Y1, os .;Ye-1) i kw(X, Yi, lend Vee): 


This definition ensures that interior multiplication 7x is still an antideriva- 
tion; the factor of k is needed to compensate for the difference between the 
factors of 1/k! and 1/(&—1)! that occur when the left-hand and right-hand 
sides of (10.3) are evaluated using the Alt convention. 

On a smooth manifold M, interior multiplication extends naturally to 
vector fields and differential forms, simply by letting 7x act pointwise: if 
X €T(M) and w € A*(M), define a (k — 1)-form X 1w = ixw by 


(X 1W)p = Xp Ip. 


Exercise 10.6. If X is a smooth vector field and w is a differential form, 
show that X Iw is smooth. 


Now suppose M is a smooth manifold and S C M is a hypersurface 
(immersed or embedded). A vector field along S is a continuous map 
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N: S — TM with the property that N, € T,M for each p € S. (Note 
the difference between this and a vector field on S, which would have the 
property that N, € T,S at each point.) A vector N, € T,M is said to 
be transverse (to S) if T,M is spanned by N, and T,S for each p € S. 
Similarly, a vector field N along S is transverse if N, is transverse at each 
pes. 

For example, any smooth vector field on M restricts to a smooth vector 
field along S; it is transverse if and only if it is nowhere tangent to S. 


Proposition 10.8. Suppose M is an oriented smooth n-manifold, S is 
an immersed hypersurface in M, and N is a smooth transverse vec- 
tor field along S. Then S has a unique orientation with the prop- 
erty that (E1,...,En-1) is an oriented basis for T,S if and only if 
(Np, £1,...,En—1) ts an oriented basis for T,M. If Q is an orientation 
form for M, then (N 1Q)|g is an orientation form for S with respect to 
this orientation. 


Remark. When n = 1, since S is a 0-manifold, this proposition should be 
interpreted as follows: At each point p € S, we assign the orientation +1 to 
p if N, is an oriented basis for T,M, and —1 if Np is negatively oriented. 
With this understanding, the proof below goes through in this case without 
modification. 


Proof. Let Q be a smooth orientation form for M. Then w = (N 1Q)|g is 
a smooth (n — 1)-form on S. It will be an orientation form for S if we can 
show that it never vanishes. Given any basis (£),...,£,—1) for T,S, the 
fact that N is transverse to S implies that (Np, £1,...,£n—1) is a basis for 
T,M. The fact that 2 is nonvanishing implies that 


Wp(Fr, seey En—1) = Qy(Np, Fi, sey En—1) # 0. 


Since w,(£1,...,E,-1) > 0 if and only if Q,(Np, £1,..., En) > 0, the 
orientation determined by w is the one defined in the statement of the 
proposition. 


Example 10.9. Considering S” as a hypersurface in R"*+!, the vector field 
N = x'0/0z'* along S” is easily seen to be transverse. (In fact, this vector 
field is orthogonal to S” with respect to the Euclidean metric.) Thus it 
induces an orientation on S”. This shows that all spheres are orientable. 
(The orientation on S° given by this construction is the one that assigns 
the orientation +1 to the point +1 € S° and —-1 to —1 € S°.) 


Manifolds with Boundary 


An important application of this construction is to define a canonical ori- 
entation on the boundary of any oriented manifold with boundary. First, 
we note that an orientation of a smooth manifold with boundary can be 
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defined exactly as in the case of a smooth manifold, with “chart” replaced 
by “generalized chart” as necessary. 

One situation that arises frequently is the following. If M is a smooth n- 
manifold, a compact, embedded n-dimensional submanifold with boundary 
DCM is called a regular domain in M. An orientation on M immediately 
yields an orientation on D, for example by restricting an orientation n- 
form to D. Examples are the closed unit ball in R” and the closed upper 
hemisphere in S”, each of which inherits an orientation from its containing 
manifold. 

If M is a smooth manifold with boundary, 0M is easily seen to be an 
embedded hypersurface in MW. Recall that any point p € M is in the domain 
of a generalized chart (U, vy), which means that ¢ is a diffeomorphism from 
U onto an open subset U c H”. Since 0M is locally characterized by 
zx” = 0 in such charts, generalized charts play a role for OM analogous to 
slice charts for ordinary embedded submanifolds. 

Let p € OM. A vector N € T,M is said to be inward-pointing if N ¢ 
T,OM and for some € > 0 there exists a smooth curve segment ¥: [0,¢] > 
M such that y(0) = p and 7/(0) = N. It is said to be outward-pointing if 
—N is inward-pointing. The following lemma gives another characterization 
of inward-pointing vectors, which is usually much easier to check. 


Lemma 10.10. Suppose M is a smooth manifold with boundary and p © 
OM. A vector N € T,M is inward-pointing if and only if N has strictly 
positive x”-component in every generalized chart (x',...,2"). 


Exercise 10.7. Prove Lemma 10.10. 


A vector field along 0M (defined just as for ordinary hypersurfaces) is 
said to be inward-pointing or outward-pointing if its value at each point 
has that property. 


Lemma 10.11. [f M is any smooth manifold with boundary, there is a 
smooth outward-pointing vector field along OM. 


Proof. Cover a neighborhood of 0M by generalized coordinate charts 
{(Ua, Ga)}. In each such chart Ua, Na = —0/0x"lamavu, is a smooth 
vector field along 0M Ug, which is outward-pointing by Lemma 10.10. 
Let {wo} be a partition of unity subordinate to the cover {Ug 1 OM} of 
OM, and define a global vector field N along 0M by 


N= S > taNa. 


Clearly N is a smooth vector field along 0M. To show that it is outward- 
pointing, let (y!,...,y”) be any generalized coordinates in a neighborhood 
of p € OM. Because each N, is outward-pointing, it satisfies dy”(Nq) < 0. 
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Therefore, the y”-component of N at p satisfies 


dy" (Np) = >) ba(p)dy”(Nalp). 


This sum is strictly negative, because each term is nonpositive and at least 
one term is negative. 


Proposition 10.12 (The Induced Orientation on a Boundary). 
Let M be an oriented smooth manifold with boundary. The orientation 
on OM determined by any outward-pointing vector field along OM is 
independent of the choice of vector field. 


Remark. As a consequence of this proposition, there is a unique orientation 
on OM determined by the orientation of M. We call this orientation the 
induced orientation or the Stokes orientation on OM. (The second term is 
chosen because of the role this orientation will play in Stokes’s theorem, to 
be described later in this chapter.) 


Proof. Let Q be an orientation form for M, and let (x',...,7”) be general- 
ized coordinates for M in a neighborhood of p € M. Replacing x! by —a! 
if necessary, we may assume they are oriented coordinates, which implies 
that Q = fdr! A---A dx” (locally) for some strictly positive function f. 
Suppose JN is an outward-pointing vector field along OM. The orientation 
of OM determined by N is given by the orientation form (N JQ)|am. 

Because x” = 0 along OM, the restriction dz" |ayy is equal to zero (Prob- 
lem 5-16). Therefore, using the antiderivation property of in, 


(N 1Dlom = f So (-1) "da" (N)de"lam A+++ Adz'lom A+++ de” om 
i=l 
= (—1)""1 fdx"(N)dz"|an Asreis dz” lou. 


Because dx"(N) = N” < 0, this is a positive multiple of (—1)"dz"|au A 
--»A dz” Jom. If N is any other outward-pointing vector field, the same 
computation shows that (N 1Q)|au is a positive multiple of the same 
(n — 1)-form, and thus a positive multiple of (N 1Q)|aac. This proves that 
N and N determine the same orientation on 0M. 


Example 10.13. This proposition gives a simpler proof that S” is ori- 
entable, because it is the boundary of the closed unit ball. 


Example 10.14. Let us determine the induced orientation on OH!” when 
H” itself has the standard orientation inherited from R”. We can identify 
OH” with R” under the correspondence (z!,...,2”~1,0) @ (a',...,a"7'). 
Since the vector field —O0/0x”" is outward-pointing along OHI”, the stan- 
dard coordinate frame for R”~! is positively oriented for OH” if and only 
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if [-0/dx",0/dz',...,0/0x"~"] is the standard orientation for R”. This 
orientation satisfies 


[-0/d2",8/dx',...,8/x"-] = -[0/82",8/dx',...,8/Ax"-) 
= (-1)"(0/de!,...,0/dx"-!, a/Ox"). 


Thus the induced orientation on OH” is equal to the standard orientation 
on R"™~! when n is even, but it is opposite to the standard orientation 
when n is odd. In particular, the standard coordinates on OH” = R"~! are 
positively oriented if and only if n is even. (This fact will play an important 
role in the proof of Stokes’s theorem below.) 


Integration of Differential Forms 


In this section, we will define in an invariant way the integrals of differential 
forms over manifolds. You should be sure you are familiar with the basic 
properties of multiple integrals in R", as summarized in the Appendix. 

We begin by considering differential forms on subsets of R”. For the time 
being, let us restrict attention to the case n > 1. Let D C R” be a compact 
domain of integration, and let w be a smooth n-form on D. (Remember, 
this means that w has a smooth extension to some open set containing D.) 
Any such form can be written as 


w= fdxtr---Adz” 


for a function f € C%°(D). We define the integral of w over D to be 


f= 


This can be written more suggestively as 


[pastas nde =f fact --ao" 
D D 


In simple terms, to compute the integral of a form such as f dx'A---Adz”, 
we just “erase the wedges”! 

Somewhat more generally, let U be an open set in R”. We would like to 
define the integral of any compactly supported n-form w over U. However, 
since neither U nor suppw may be a domain of integration in general, we 
need the following lemma. 


Lemma 10.15. Suppose K C U C R”, where U is an open set and K 
is compact. Then there is a compact domain of integration D such that 
KCDCU. 
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Proof. For each p € K, there is an open ball containing p whose closure is 
contained in U. By compactness, finitely many such open balls Uj,...,Um 
cover I. Since the boundary of an open ball is a codimension-1 submani- 
fold, it has measure zero by Theorem 6.6, and so each ball is a domain of 
integration. The set D = U,U---UU, is the required domain of integra- 
tion. 


Now if U C R” is open and w is a compactly supported n-form on U, we 


define 
fo-f» 
U D 


where D is any domain of integration such that suppw C DC U. It is an 
easy matter to verify that this definition does not depend on the choice of 
D. Similarly, if V is an open subset of the upper half-space H” and w is a 
compactly supported n-form on V, we define 


| i - < 
Vv DOH” 


where D is chosen in the same way. 

It is worth remarking that it is possible to extend the definition to inte- 
grals of noncompactly supported forms, and integrals of such forms play an 
important role in many applications. However, in such cases the resulting 
multiple integrals are improper, so one must pay close attention to conver- 
gence issues. For the purposes we have in mind, the compactly supported 
case will be more than sufficient. 

The motivation for this definition is expressed in the following proposi- 
tion. 


Proposition 10.16. Let D and E be domains of integration in R”, and let 
w be an n-form on E. If G: D — E is a smooth map whose restriction to 
Int D is an orientation-preserving or orientation-reversing diffeomorphism 
onto Int E, then 


fe 


Proof. Let us use (y',...,y") to denote standard coordinates on E, and 
(z1,...,2”) to denote those on D. Suppose first that G is orientation- 
preserving. Writing w = f dy! A--. Ady”, the change of variables formula 


| G*w if G is orientation-preserving, 
D 


-| G*w if G is orientation-reversing. 
D 
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together with the formula of Lemma 9.11 for pullbacks of n-forms yield 


If G is orientation-reversing, the same computation holds except that a 
negative sign is introduced when the absolute value signs are removed. 


Corollary 10.17. Suppose U,V are open subsets of R", G: U — V is 
an orientation-preserving diffeomorphism, and w is a compactly supported 


n-form on V. Then 
fe-f Gru. 
Vv U 


Proof. Let E Cc V be a domain of integration containing suppw. Since 
smooth maps take sets of measure zero to sets of measure zero, D = 
G-'(E) Cc U is a domain of integration containing supp G*w. Therefore, 
the result follows from the preceding proposition. 


Using this result, it is easy to make invariant sense of the integral of a 
differential form over an oriented manifold. Let M be a smooth, oriented 
n-manifold and w an n-form on M. Suppose first that w is compactly 
supported in the domain of a single oriented coordinate chart (U,y). We 
define the integral of w over M to be 


[e= i: eve 


Since (y~+)*w is a compactly supported n-form on the open subset y(U) C 
R”, its integral is defined as discussed above. 


Proposition 10.18. With w as above, 1 w does not depend on the choice 
of oriented coordinate chart whose domain contains supp w. 


Proof. Suppose (U,6) is another oriented chart such that suppw C U. 
Because Go y~ is an orientation-preserving diffeomorphism from y(U NU) 
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to G(U NU), Corollary 10.17 implies that 
[ @ye=f oye 
PU) e(UNU) 
=f Gow y@ rw 
(07) 


unv) 
= i} @Yy@oeye 
y(UNU) 


- y, ioe 


Thus the two definitions of [,, w agree. 


If M is an oriented smooth n-manifold with boundary, and w is an n-form 
on M that is compactly supported in the domain of a generalized chart, 
the definition of [ ww and the statement and proof of Proposition 10.18 
go through unchanged, provided we interpret all the coordinate charts as 
generalized charts, and compute the integrals over open subsets of H” in 
the way we described above. 

To integrate over an entire manifold, we simply apply this same definition 
together with a partition of unity. Suppose MW is an oriented smooth n- 
manifold (possibly with boundary) and w is a compactly supported n-form 
on M. Let {(Ui,y:)} be a finite cover of suppw by oriented coordinate 
charts (generalized charts if M has a nonempty boundary), and let {w;} 
be a subordinate partition of unity. We define the integral of w over M to 


be 
[- = »; i: Ww. (10.4) 


Since for each i, the n-form w,w is compactly supported in U;, each of 
the terms in this sum is well-defined according to our discussion above. To 
show that the integral is well-defined, therefore, we need only examine the 
dependence on the charts and the partition of unity. 


Lemma 10.19. The definition of [,,w given above does not depend on the 
choice of oriented charts or partition of unity. 


Proof. Suppose {(U;,,)} is another finite collection of oriented charts 
whose domains cover suppw, and {w,;} is a subordinate partition of unity. 
For each i, we compute 


[vem f (Svi)v0 
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Summing over 2, we obtain 
Df w= Tf Fj. 
4 7M ig 0M 
Observe that each term in this last sum is the integral of a form compactly 
supported in a single (generalized) chart (U;, for example), so by Proposi- 


tion 10.18 each term is well-defined, regardless of which coordinate map we 
use to compute it. The same argument, starting with [ ue Yj, shows that 


ae ui fi S He 


Thus both definitions yield the same value for [ Wace 


As usual, we have a special definition in the 0-dimensional case. The 
integral of a compactly supported 0-form (i.e., a function) f over an oriented 
0-manifold M is defined to be the sum 


[f S> +F(0), 


pEeM 


where we take the positive sign at points where the orientation is positive 
and the negative sign at points where it is negative. The assumption that 
f is compactly supported implies that there are only finitely many nonzero 
terms in this sum. 

If N C M is an oriented immersed k-dimensional submanifold (with or 
without boundary), and w is a k-form on M, we interpret [ yw to mean 
Jy (wl). In particular, if M is a compact oriented manifold with boundary, 
then OM is a compact embedded (n — 1)-manifold (without boundary). 
Thus if w is an (n — 1)-form on M, we can interpret am Y unambiguously 
as the integral of w|gnq over OM, where 0M is always understood to have 
the induced orientation. 


Proposition 10.20 (Properties of Integrals of Forms). Suppose M 
and N are oriented smooth n-manifolds with or without boundaries, and 
w,7 are compactly supported n-forms on M. 


(a) Linearity: If a,b € R, then 
i aw + by =a f v+of 1. 
M M M 


(b) ORIENTATION REVERSAL: If M denotes M with the opposite orien- 


tation, then 
fon-fw 
M M 
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(c) Positivity: If w is an orientation form for M, then J,,w > 0. 


(d) DIFFEOMORPHISM INVARIANCE: If F: N — M is an orientation- 
preserving diffeomorphism, then [,,w = Jy F*w. 


Proof. Parts (a) and (b) are left as an exercise. Suppose that w is an orien- 
tation form for M. This means that for any oriented chart (U, y), (y*)~tw 
is a positive function times dz! /---Adx”. Thus each term in the sum (10.4) 
defining f[ yy is nonnegative, and at least one term is strictly positive, thus 
proving (c). 

To prove (d), it suffices to assume that w is compactly supported in a 
single coordinate chart, because any n-form on M can be written as a finite 
sum of such forms by means of a partition of unity. Thus suppose (U, vy) 
is an oriented coordinate chart on M whose domain contains the support 
of w. It is easy to check that (F~1(U), yo F) is an oriented coordinate 
chart on N whose domain contains the support of F'*w, and the result then 
follows immediately from Corollary 10.17. 


Exercise 10.8. Prove parts (a) and (b) of the preceding proposition. 


Although the definition of the integral of a form based on partitions 
of unity is very convenient for theoretical purposes, it is useless for doing 
actual computations. It is generally quite difficult to write down a partition 
of unity explicitly, and even when one can be written down, one would 
have to be exceptionally lucky to be able to compute the resulting integrals 
(think of trying to integrate e~!/*). For computational purposes, it is much 
more convenient to “chop up” the manifold into a finite number of pieces 
whose boundaries are sets of measure zero, and compute the integral on 
each one separately. One way to do this is described below. 

A subset D C M is called a domain of integration if D is compact and 0D 
has measure zero (in the sense described in Chapter 6). For example, any 
regular domain (i.e., compact embedded n-submanifold with boundary) in 
an n-manifold is a domain of integration. 


Proposition 10.21. Let M be a compact, oriented, smooth n-manifold 
with or without boundary, and let w be ann-form on M. Suppose Fy,..., Ex 
are compact domains of integration in M; D,,..., Dx are compact domains 
of integration in R”; and fori =1,...,k, Fj: Dj — M are smooth maps 
satisfying 


(4) F,(D;) = E;, and Film p, is an orientation-preserving diffeomor- 
phism from Int D; onto Int E;. 


(ii) M = E,U---UE,. 


(tt) For eachi# Jj, E; and E; intersect only on their boundaries. 
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fend fe Pre. 


Proof. As in the preceding proof, it suffices to assume that w is compactly 
supported in the domain of a single oriented chart (U, vy). In fact, by starting 
with a cover of M by sufficiently nice charts, we may assume that OU has 
measure zero, and that y extends to a diffeomorphism from U to a compact 
domain of integration kK C H”. 

For each 3, let 


Then 


A; =UNE; CM. 


Then A; is a compact subset of M whose boundary has measure zero, since 
OA; C OU U F;(OD;). Define compact subsets B;,C; C R” by 


B= F;* (Aa), 


Since smooth maps take sets of measure zero to sets of measure zero, both 
B;, and C; are domains of integration, and yo F; maps B; to C; and re- 
stricts to a diffeomorphism from Int B; to Int C;. Therefore Proposition 


10.16 implies that 
[eres fi rte. 
Summing over 7, and noting that the interiors of the various sets A; are 
disjoint, we obtain 
fe=[ore 
M K 
=> f ye 
4 Gi 
=O f me 
4 Bi 
=f Ft. 
4 UD: 


Example 10.22. Let us use this technique to compute the integral of a 
2-form over S*, oriented by means of the outward-pointing vector field 
N=20/0xa+ yO/0y + 20/0z. Let w be the following 2-form: 


w=axdy\dz+ydzNdx+ zdz dy. 
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If we let D be the rectangle [0,7] x [0,27] and F: D — S? be the spherical 
coordinate map 


F(y,0) = (sin pcos 6, sin ysin 6, cos y), 


then the single map F: D — S? satisfies the hypotheses of Proposition 
10.21, provided that it is orientation-preserving. Assuming this for the mo- 
ment, we note that 


F*dx = cos pcos 6 dy — sin ysin 0 dé, 
F*dy = cos ysin 0 dy + sin ycos 6 dé, 
F*dz = —-singdy. 


Therefore, 


[en [ re 
S2 D 
-| —sin® cos? 6d A dy + sin? ysin? 6 dy A dé 
D 


+ cos” ysin y cos” 6 dy A dé — cos” ysiny sin? 6 dé A dy 


a sin yp dy A dé 
D 


27 wT 
= | | sin y dy dé 
0 0 


=A4r. 


To check that F is orientation-preserving, we need to show that 
(F.0/09, F.0/00) is an oriented basis for S? at each point, which means by 
definition that (N, F.0/0y, F.0/00) is an oriented basis for R*. Calculating 
at an arbitrary point (x,y,z) = F'(y, 9), we find 


) ) 3) 
N= sin y cos 0 Be a + cos p23 


F, = cos ise + cos fy cern sy 
"Op % Ox - Oy P Bz! 


0 
F,— = —sin ysin@— + sin pcosd—. 
80 ene Pe” Oy 
The transition matrix is therefore 


sinycos@ sinysin@ cosy 
cosycos@ cosysin@é —siny], 
—sinysin@ sinycosé 0 


which has determinant siny > 0. 
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Stokes’s Theorem 


In this section we will state and prove the central result in the theory of 
integration on manifolds: Stokes’s theorem for manifolds. This is a far- 
reaching generalization of the fundamental theorem of calculus and of the 
classical theorems of vector calculus. 


Theorem 10.23 (Stokes’s Theorem). Let M be an oriented n- 
dimensional manifold with boundary, and let w be a compactly supported 


(n —1)-form on M. Then 
i. dw -| Ww. (10.5) 
M aM 


The statement of this theorem is concise and elegant, but it requires a 
bit of interpretation. First, as usual, OM is understood to have the induced 
(Stokes) orientation, and w is understood to be restricted to 0M on the 
right-hand side. If OM = ©, then the right-hand side is to be interpreted 
as zero. When M is 1-dimensional, the right-hand integral is really just a 
finite sum. 

With these understandings, we proceed with the proof of the theorem. 
You should check as you read through the proof that it works correctly 
when n= 1. 


Proof. We begin by considering a very special case: Suppose M is the upper 
half space H” itself. Then the fact that w has compact support means that 
there is a number R > 0 such that suppw is contained in the rectangle 
A= [—R, R|x---x[—R, R] x [0, R]. We can write w in standard coordinates 
as 


n 
w= So wide A---Adxid--- Ada", 
i=l 
where the hat means that dx* is omitted. Therefore, 


n 
dy = S~ du, Ada’ \-++ N dat \-+- Ada” 
i=l 


n 
Ow; . ; a 
= 7 7 da? A dat N+ N dai N+» Ada” 
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Thus we compute 


[w= doe OOS epg 


A Ox? 


=Senet fff" Beye an 


We can rearrange the order of integration in each term so as to do the 
zx’ integration first. By the fundamental theorem of calculus, the terms for 
which i # n reduce to 


Bar ff fl Bea 


i=l 
n—1 R pR R 
Ow; 

= a 1) if . = (x)dx' da’ ++. dxt---dax” 
i=l 0 J-R _—R OX 
n—1 R pR R v'=R se 

=) (-1)"? | | w(x) dx! --- dat ++. dx” 
i=1 0 —R —R zt=—R 

= 0, 

because we have chosen R large enough that w = 0 when 2? = +R. The 


only term that might not be zero is the one for which 7 = n. For that term 
we have 


Ow 
op yr- 1 n ap 1 
f ta = | i Lh ee dex” da! 
ey cof wij (a) 
LR es) gr= 
R R 
=u fo fF ailaty....2" 4,0) dat da, 
—R —R 


because w,, = 0 when x” = R. 
To compare this to the other side of (10.5), we compute as follows: 


| w= >| wi(at,..., 27-1, 0) da A---AdxtA+--Adz”. 
oH” 7 2 ANdH” 


"ae! secant (10.6) 


Because x” vanishes on OH", the restriction of dx” to the boundary is 
identically zero. Thus the only term above that is nonzero is the one for 
which i = n, which becomes 


: v= f Walt) .cape? 0) da Ao de. 
OH” ANOH” 
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Taking into account the fact that the coordinates (a1,...,2"~+) are posi- 


tively oriented for OH” when n is even and negatively oriented when 7 is 
odd (Example 10.14), this becomes 


R R 
| w= uy f | GA oga iO dat esa, 
oH” Lhe J oR 


which is equal to (10.6). 

Next let M be an arbitrary manifold with boundary, but consider an 
(n —1)-form w that is compactly supported in the domain of a single (gen- 
eralized) chart (U,y). Assuming without loss of generality that y is an 
oriented chart, the definition yields 


[w= f wryae= fae), 


since (y~+)*dw is compactly supported on H”. By the computation above, 
this is equal to 


(p*)*w, (10.7) 
oH” 


where OH” is given the induced orientation. Since y, takes outward- 
pointing vectors on OM to outward-pointing vectors on H” (by Lemma 
10.10), it follows that y|unam is an orientation-preserving diffeomorphism 
onto y(U)NdH”, and thus (10.7) is equal to f,,, w. This proves the theorem 
in this case. 

Finally, let w be an arbitrary compactly supported (n—1)-form. Choosing 
a cover of suppw by finitely many oriented (generalized) coordinate charts 
{(Ui, yi)}, and choosing a subordinate partition of unity {a;}, we can apply 
the preceding argument to 7,;w for each 2 and obtain 


[en Prk 
=¥ fae 
= fa no wo 
-[E9)o+ (Ee) 
=0+f do, 


because }>; ); = 1. 
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Two special cases of Stokes’s theorem are worthy of special note. The 
proofs are immediate. 


Corollary 10.24. Suppose M is a compact manifold without boundary. 
Then the integral of every exact form over M is zero: 


i) dw=0 iffOM=@. 
M 


Corollary 10.25. Suppose M is a compact smooth manifold with bound- 
ary. If w is a closed form on M, then the integral of w over OM is zero: 


| w=0 ifdw=0o0nM. 
OM 


Example 10.26. Let N be a smooth manifold and suppose 7: [a,b] + N 
is an embedding, so that M = y[a, }] is an embedded 1-submanifold with 
boundary in N. If we give M the orientation such that y is orientation- 
preserving, then for any smooth function f € C'™(N), Stokes’s theorem 
says 


/ Soe | df = f(r) — Fr). 
[a,b] M 


Thus Stokes’s theorem reduces to the fundamental theorem for line in- 
tegrals (Theorem 4.20) in this case. In particular, when y: [a,b] > R is 
the inclusion map, then Stokes’s theorem is just the ordinary fundamental 
theorem of calculus. 


Another application of Stokes’s theorem is to prove the classical result 
known as Green’s theorem. 


Theorem 10.27 (Green’s Theorem). Suppose D is a regular domain 
in R?, and P,Q are smooth real-valued functions on D. Then 


| (32 - =) dz dy = Pdz+Q dy. 
p \ Ox Oy aD 


Proof. This is just Stokes’s theorem applied to the 1-form Pdz+Q dy. 


We will see other applications of Stokes’s theorem later in this chapter. 


Manifolds with Corners 


In many applications of Stokes’s theorem, it is necessary to deal with geo- 
metric objects such as triangles, squares, or cubes that are topological 
manifolds with boundary, but are not smooth manifolds with boundary 
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because they have “corners.” It is easy to generalize Stokes’s theorem to 
this situation, and we do so in this section. We will use this generalization 
only in our discussion of de Rham cohomology in Chapter 11. 

Let R” denote the closed positive “quadrant” of R”: 


Re arene) ER ia Ss Osanja” Soy 
This space is the model for the type of corners we will be concerned with. 


Exercise 10.9. Prove that Rv is homeomorphic to the upper half-space 
H”. 


Suppose M is a topological n-manifold with boundary. A chart with 
corners for M is a pair (U,y), where U is an open subset of M, and y isa 
homeomorphism from U to a (relatively) open set U Cc R?. Two charts with 
corners (U,y), (V,w) are said to be smoothly compatible if the composite 
map pow t: W(UNV) — g(UNV) is smooth. (As usual, this means that 
it admits a smooth extension to an open set in R”.) 

A smooth structure with corners on a topological manifold with boundary 
is a maximal collection of smoothly compatible charts with corners whose 
domains cover M. A topological manifold with boundary together with a 
smooth structure with corners is called a smooth manifold with corners. 

If M is a smooth manifold with corners, any chart with corners (U, vy) 
in the given smooth structure with corners is called a smooth chart with 
corners for M. 


Example 10.28. Any closed rectangle in R” is a smooth n-manifold with 
corners. 


Because of the result of Exercise 10.9, charts with corners are indistin- 
guishable topologically from generalized charts in the sense of topological 
manifolds with boundary. Thus from the topological view there is no dif- 
ference between manifolds with boundary and manifolds with corners. The 
difference is in the smooth structure, because the compatibility condition 
for charts with corners is different from that for generalized charts. 

It is easy to check that the boundary of Rv in R” is the set of points at 
which at least one coordinate vanishes. The points in Ry at which more 
than one coordinate vanishes are called its corner points. 


Lemma 10.29. Let M be a smooth n-manifold with corners, and let p © 
M. If p(p) is a corner point for some smooth chart with corners (U,¢), 
then the same is true for every such chart whose domain contains p. 


Proof. Suppose (U,y) and (V,~) are two charts with corners such that 
y(p) is a corner point but w(p) is not. To simplify notation, let us assume 
without loss of generality that y(p) has coordinates (x!,...,a*,0,...,0) 
with k < n— 2. Then 7(V) contains an open subset of some (n — 1)- 
dimensional linear subspace S C R”, with w(p) € S. (If w(p) is a boundary 
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point, S can be taken to be the unique subspace of the form 2* = 0 that 
contains ~(p). If w(p) is an interior point, any (n—1)-dimensional subspace 
containing ~)(p) will do.) 

Let a: SNW(V) > R” be the restriction of pow! to SNY(V). Because 
pow! is a diffeomorphism, a is a smooth immersion. Let T = a,$ C R”. 
Because T is (n — 1)-dimensional, it must contain a vector X such that one 
of the last two components X"~! or X” is nonzero (otherwise T would be 
contained in a codimension-2 subspace). Renumbering the coordinates and 
replacing X by —X if necessary, we may assume that X” < 0. 

Now let y: (—e,¢€) — S be a smooth curve such that 7(0) = p and 
axy'(0) = X. Then a(7(t)) has negative x” coordinate for small t > 0, 
which contradicts the fact that a takes its values in R?. 


If M isa smooth manifold with corners, a point p € M is called a corner 
point if y(p) is a corner point in R, with respect to some (and hence every) 
smooth chart with corners (U,y). It is clear that every smooth manifold 
with or without boundary is also a smooth manifold with corners (but with 
no corner points). Conversely, a smooth manifold with corners is a smooth 
manifold with boundary if and only if it has no corner points. The boundary 
of a smooth manifold with corners, however, is in general not a smooth 
manifold with corners (think of the boundary of a cube, for example). In 
fact, even the boundary of Rv itself is not a smooth manifold with corners. 
It is, however, a union of finitely many such: OR’ = H, U---U H;,, where 


Hea {(o% 0.150") € Re = 0} 


is an (n — 1)-dimensional smooth manifold with corners contained in the 
subspace defined by x? = 0. 

The usual flora and fauna of smooth manifolds—smooth maps, partitions 
of unity, tangent vectors, covectors, tensors, differential forms, orientations, 
and integrals of differential forms—can be defined on smooth manifolds 
with corners in exactly the same way as we have done for smooth manifolds 
and smooth manifolds with boundary, using smooth charts with corners in 
place of smooth charts or generalized charts. The details are left to the 
reader. 

In addition, for Stokes’s theorem we will need to integrate a differen- 
tial form over the boundary of a smooth manifold with corners. Since the 
boundary is not itself a smooth manifold with corners, this requires a spe- 
cial definition. Let M be an oriented smooth n-manifold with corners, and 
suppose w is an (n — 1)-form on 0M that is compactly supported in the 
domain of a single oriented smooth chart with corners (U,y). We define 
the integral of w over OM by 


ferLfere 
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where each H; is given the induced orientation as part of the boundary of 
the set where x’ > 0. In other words, we simply integrate w in coordinates 
over the codimension-1 portion of the boundary. Finally, if w is an arbitrary 
compactly supported (n — 1)-form on M, we define the integral of w over 
OM by piecing together with a partition of unity just as in the case of a 
manifold with boundary. 

In practice, of course, one does not evaluate such integrals by using par- 
titions of unity. Instead, one “chops up” the boundary into pieces that can 
be parametrized by compact Euclidean domains of integration, just as for 
ordinary manifolds with or without boundary. If M is a smooth manifold 
with corners, we say a subset A C OM has measure zero in OM if for every 
smooth chart with corners (U, vy), each set y(A) M H; has measure zero in 
A; for i = 1,...,n. A domain of integration in 0M is a subset EF C OM 
whose boundary has measure zero in 0M. The following proposition is an 
analogue of Proposition 10.21. 


Proposition 10.30. The statement of Proposition 10.21 is true if M is 
replaced by the boundary of a compact, oriented, smooth n-manifold with 
corners. 


Exercise 10.10. Show how the proof of Proposition 10.21 needs to be 
adapted to prove Proposition 10.30. 


Example 10.31. Let J x I = [0,1] x [0,1] be the unit square in R?, and 
suppose w is a smooth 1-form on O(J x I). Then it is not hard to check 
that the maps F;: I — I x I given by 


Fi(t) = (¢,0), 

F(t) = (1,4), 

F3(t) = (1—#,1), ee 
F(t) = (0,1-2) 


satisfy the hypotheses of Proposition 10.30. (These four curve segments in 
sequence traverse the boundary of J x I in the counterclockwise direction.) 


Therefore, 
i v= | w+ f wtf wtf Ww. (10.9) 
O(IxI) Fi F2 Fs Fy 


Exercise 10.11. Verify the claims of the preceding example. 


The next theorem is the main result of this section. 


Theorem 10.32 (Stokes’s Theorem on Manifolds with Corners). 
Let M be a smooth n-manifold with corners, and let w be a compactly 
supported (n — 1)-form on M. Then 


/ aw = f Ww. 
M OM 
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Proof. The proof is nearly identical to the proof of Stokes’s theorem proper, 
so we will just indicate where changes need to be made. By means of smooth 
charts with corners and a partition of unity just as in that proof, we may 
reduce the theorem to the case in which M = R®. In that case, calculating 
exactly as in the proof of Theorem 10.23, we obtain 


as Oi 
dw = -1)*" 1... da” 
fate Dee fon [Fone a 
in - * wi Ce ae ee n 
=a) fr is aie dx’ ---dx*---dx 
x*=0 


2 
Yc we fro Wn (x Pees 0,...,2") del... dat -.-da” 
2D 


(The factor (—1)* disappeared because the induced orientation on H; is 


(—1)' times that of the standard coordinates (x!,...,27,...,2).) This 
completes the proof. 


Here is an immediate application of this result, which we will use 
when we study de Rham cohomology in the next chapter. Recall that 
two curve segments Yo,71: [a,b] > M are said to be path homotopic if 
they are homotopic relative to their endpoints, that is, homotopic via a 
homotopy H: [a,b] x I — M such that H(a,t) = yo(a) = yi(a) and 
HA(b,t) = yo(b) = 71(b) for all t € J. 


Theorem 10.33. Suppose M is a smooth manifold, and yo, 1: [a,b] ~ M 
are path homotopic piecewise smooth curve segments. For every closed 1- 


form w on M, 
/ i 
YO Y1 


Proof. By means of an affine reparametrization, we may as well assume for 
simplicity that [a,b] = [0,1]. Assume first that yo and 7 are smooth. By 
Proposition 6.20, yo and 7 are smoothly homotopic relative to {0,1}. Let 
H:IxI-—M be such a smooth homotopy. Since w is closed, we have 


| d(H*w) = H*dw = 0. 
IxI IxI 
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On the other hand, J x I is a manifold with corners, so Stokes’s theorem 
implies 


o= | aH") = | H*w. 
IxI O(lxI) 


By Example 10.31 together with the diffeomorphism invariance of line in- 
tegrals (Exercise 4.9), therefore, 


0= | H*w 
a(Ix 1) 


= A*wt H*wt+ AH*wt H*w 
F, Fy Fs Fy 


=) wtf w+ | w+ | W, 
HoF; HoF» HoF3 HoF4 


where F\, Fo, F3, F, are defined by (10.8). The fact that H is relative to 
{0,1} means that Ho Fy) and Ho Fy are constant maps, and therefore the 
second and fourth terms above are zero. The theorem then follows from 
the facts that Ho F, = y and H o Fs is a backward reparametrization of 
V1: 

Next we consider the general case of piecewise smooth curves. We cannot 
simply apply the preceding result on each subinterval where yo and 71 
are smooth, because the restricted curves may not start and end at the 
same points. Instead, we will prove the following more general claim: Let 
40,71: 1 — M be piecewise smooth curve segments (not necessarily with 
the same endpoints), and suppose H: Ix I > M is any homotopy between 
them. Define curve segments 09,01: 1 — M by 


and let d9,01, be any smooth curve segments that are path homotopic to 
00,01 respectively. Then 


[o- fier fe- fie (10.10) 


When specialized to the case in which yo and y; are path homotopic, this 
implies the theorem, because 09 and oj are constant maps in that case. 
Since yo and 71 are piecewise smooth, there are only finitely many points 
(a1,---;@m) in (0,1) at which either yo or 7 is not smooth. We will prove 
the claim by induction on the number m of such points. When m = 0, both 
curves are smooth, and by Proposition 6.20 we may replace the given ho- 
motopy H by a smooth homotopy H. Recall from the proof of Proposition 
6.20 that the smooth homotopy H can actually be taken to be homotopic 
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to H relative to I x {O}UI x {1}. Thus for i = 0,1, the curve &;(t) = H(i, t) 
is a smooth curve segment that is path homotopic to o;. In this setting, 
(10.10) just reduces to (10.9). Note that the integrals over Gp and o; do not 
depend on which smooth curves path homotopic to og and oj are chosen, 
by the smooth case of the theorem proved above. 

Now let yo, y1 be homotopic piecewise smooth curves with m nonsmooth 
points (a1,...,@m), and suppose the claim is true for curves with fewer 
than m such points. For 7 = 0,1, let +; be the restriction of y; to [0, am], 
and let 7; be its restriction to [am, 1]. Let 0: I > M be the curve segment 


a(t) = H(am, bs 


and let o by any smooth curve segment that is path homotopic to a. Then, 
since y; and ¥;/’ have fewer than m nonsmooth points, the inductive hy- 
pothesis implies 


es (fe Le) 7 U,e-L,°) 
a ein heey) 
=[w- fw. 


This completes the proof. 


l| 


Integration on Riemannian Manifolds 


In this section, we explore how the theory of orientations and integration 
can be specialized to Riemannian manifolds. Thinking of R? and R® as 
Riemannian manifolds, this will eventually lead us to the classical theorems 
of vector calculus as consequences of Stokes’s theorem. 


The Riemannian Volume Form 


We begin with orientations. Let (M,g) be an oriented Riemannian mani- 
fold. We know from Proposition 8.16 that there is a smooth orthonormal 
frame (F1,...,E,) in a neighborhood of each point of M. By replacing 
E, by —F; if necessary, we can find an oriented orthonormal frame in a 
neighborhood of each point. 


Proposition 10.34. Suppose (M,g) is an oriented Riemannian n- 
manifold. There is a unique orientation form Q € A"(M) such that 


Q(Ei,-.-;En) =1 (10.11) 
for every p € M and every oriented orthonormal basis (E;) for T,M. 


258 10. Integration on Manifolds 


Remark. The n-form whose existence and uniqueness are guaranteed by 
this proposition is called the Riemannian volume form, or sometimes the 
Riemannian volume element. Because of the role it plays in integration on 
Riemannian manifolds, as we will see shortly, it is often denoted by dV, (or 
dAg or dsg in the 2-dimensional or 1-dimensional case, respectively). Be 
warned, however, that this notation is not meant to imply that the volume 
form is the exterior derivative of an (n — 1)-form; in fact, as we will see in 
Chapter 11, this is never the case on a compact manifold. You should just 
interpret dV, as a notational convenience. 


Proof. Suppose first that such a form 2 exists. If (F1,...,£,) is any local 
oriented orthonormal frame and (e,...,¢”) is the dual coframe, we can 
write Q = fet A---Ae” locally. The condition (10.11) then reduces to 
f =1,so0 


Q=elA--- Ae", (10.12) 


This proves that such a form is uniquely determined. 

To prove existence, we would like to define 9 in a neighborhood of each 
point by (10.12). If (Fy,...,£,) is another oriented orthonormal frame, 
with dual coframe (é1,...,é”), let 


We can write 
E,; = A’E; 


for some matrix (A?) of smooth functions. The fact that both frames are 
orthonormal means that (A/(p)) € O(n) for each p, so det(A?) = +1, and 
the fact that the two frames are consistently oriented forces the positive 
sign. We compute 


Q(E,,..., En) = det(e4(E;)) 
= det(A?) 
ak 
=Q(E,,...,En). 


Thus 2 = Q, so defining Q in a neighborhood of each point by (10.12) 
with respect to some oriented orthonormal frame yields a global n-form. 
The resulting form is clearly smooth and satisfies (10.11) for every oriented 
orthonormal basis. 


Although the expression for the Riemannian volume form with respect 
to an oriented orthonormal frame is particularly simple, it is also useful to 
have an expression for it in coordinates. 
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Lemma 10.35. Let (M,g) be an oriented Riemannian manifold. If (x*) is 
any oriented coordinate chart, then the Riemannian volume form has the 
local coordinate expression 


dV = 4/det(gi;) dai A+++ A dx, 


where gi; are the components of g in these coordinates. 


Proof. Let (x') be oriented coordinates near p € M. Then locally Q = 
f dz! A---Adx” for some positive coefficient function f. To compute f, let 
(£;) be any oriented orthonormal frame defined on a neighborhood of p, 
and let (e’) be the dual coframe. If we write the coordinate frame in terms 
of the orthonormal frame as 


then we can compute 


On the other hand, observe that 


Pe ee 
Hi \ Ogi? Oxi 


= (AP Ex, Aj Ei) 
= APA'(Ex, Ei) 


aS APA, 
k 


This last expression is the (i, j)-entry of the matrix product A’ A, where 
A = (A?). Thus 
det(g;;) = det(A’ A) = det A? det A = (det A)’, 


from which it follows that f = det A = +,/det(g;;). Since both frames 
(0/0z") and (E;) are oriented, the sign must be positive. 
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We noted earlier that real-valued functions cannot be integrated in a 
coordinate-independent way on an arbitrary manifold. However, with the 
additional structures of a Riemannian metric and an orientation, we can 
recover the notion of the integral of a function. 

Suppose (M,g) is an oriented Riemannian manifold (with or without 
boundary), and let dV, denote its Riemannian volume form. If f is a com- 
pactly supported smooth function on M, then f dV, is an n-form, so we 
can define the integral of f over M to be te f dV,. (This, of course, is the 
reason we chose the notation dV, for the Riemannian volume form.) If M 
itself is compact, we define the volume of M by 


Vol(M) = ‘) dV. 
M 


Lemma 10.36. Let (M,g) be an oriented Riemannian manifold. If f is a 
compactly supported smooth function on M and f > 0, then ii f dV, = 0, 
with equality if and only if f =0. 


Proof. Clearly Lip taVe = 0 if f is identically zero. If f > 0 and f is 
positive somewhere, then f dV, is an orientation form on the open subset 
U CM where f > 0, so the result follows from Proposition 10.20(c). 


Hypersurfaces in Riemannian Manifolds 


Let (1M, g) be an oriented Riemannian manifold, and suppose S Cc M isa 
submanifold. A vector field N along S is said to be normal to S if Np L T)S 
for each p € S. If S is a hypersurface, then any unit normal vector field 
along S' is clearly transverse to $, so it determines an orientation of S by 
Proposition 10.8. The next proposition gives a very simple formula for the 
volume form of the induced metric on S' with respect to this orientation. 


Proposition 10.37. Let (M,g) be an oriented Riemannian manifold, let 
SCM be an immersed hypersurface, and let g denote the induced metric 
on S. Suppose N is a smooth unit normal vector field along S. With respect 
to the orientation of S' determined by N, the volume form of (S,g) is given 
by 

dV5 = (N 1dV,)|s. 


Proof. By Proposition 10.8, the (n—1)-form N 1dV, is an orientation form 
for S. To prove that it is the volume form for the induced Riemannian 
metric, we need only show that it gives the value 1 whenever it is applied 
to an oriented orthonormal frame for S. Thus let (£1,...,£n—1) be such a 
frame. At each point p € S, the basis (Np, Eilp,--.,En—1|p) is orthonormal, 
and is oriented for T,,M (this is the definition of the orientation determined 
by NV). Thus 


(N 1dV,)|s(Fi,-- ., En-1) = dv,(N, E4,.. .; En-1) = 1, 
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which proves the result. 


The following lemma will be useful in our proofs of the classical theorems 
of vector analysis below. 


Lemma 10.38. With notation as in Proposition 10.87, if X is any vector 
field along S, we have 


X IdVg|5 = (X, N) dVz. (10.13) 
Proof. Define two vector fields X' and X+ along S by 


X+ = (X,N)N, 
X'=ax—-Xt. 


Then X = X++X', where X+ is normal to S and X' is tangent to it. 
Using this decomposition, 


X 1dV, =X idV,> +X" AdV5. 
Using Corollary 10.40, the first term simplifies to 
(X+ 4dV,)| 57 = (X,N)(N 4dV,)| qq = (X, N)dV 5. 


Thus (10.13) will be proved if we can show that (X' 1dV,)|;; = 0. If 
X1,...,Xn-1 are any vectors tangent to M, then 


(XV Ns oe Rag VA, © Beige ay) = 0, 


because any 7 vectors in an (n — 1)-dimensional vector space are linearly 
dependent. 


The result of Proposition 10.37 takes on particular importance in the 
case of a Riemannian manifold with boundary, because of the following 
proposition. 


Proposition 10.39. Suppose M is any Riemannian manifold with bound- 
ary. There is a unique smooth outward-pointing unit normal vector field N 
along OM. 


Proof. First we prove uniqueness. At any point p € OM, the vector space 
(T,OM)+ Cc T,M is 1-dimensional, so there are exactly two unit vectors 
at p that are normal to 0M. Since any unit normal vector N is obviously 
transverse to OM, it must have nonzero x”-component. Thus exactly one 
of the two choices of unit normal has negative x”-component, which is 
equivalent to being outward-pointing. 

To prove existence, we will show that there exists a smooth outward 
unit normal field in a neighborhood of each point. By the uniqueness result 
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above, these vector fields all agree where they overlap, so the resulting 
vector field is globally defined. 

Let p € OM. By Proposition 8.24, there exists an adapted orthonormal 
frame for M in a neighborhood U of p: This is a local frame (F4,..., En) 
such that (£1,..., Fn 1) restricts to an orthonormal frame for 0M. This 
implies that E,, is a (smooth) unit normal vector field along OM in a 
neighborhood of p. It is obviously transverse to 0M. If we assume (by 
shrinking U if necessary) that U is connected, then EF, must be either 
inward-pointing or outward-pointing on all of OM NU. Replacing FE, by 
—E,, if necessary, we obtain a smooth outward-pointing unit normal vector 
field defined near p. This completes the proof. 


The next corollary is immediate. 


Corollary 10.40. If (M,g) is an oriented Riemannian manifold with 
boundary and g is the induced Riemannian metric on OM, then the volume 
form of g is 


dV = (N 1dV,)\anmr, 


where N is the unit outward normal vector field along OM. 


Next we will show how Stokes’s theorem reduces to familiar results in 
various special cases. 


The Divergence Theorem 


Let (M,g) be an oriented Riemannian manifold. Multiplication by the Rie- 
mannian volume form defines a map *: C°@(M) — A”(M): 


*f = fdV. 


If (£;) is any local oriented orthonormal frame, then f can be recovered 
from *f locally by 


f =(«f)(h,..., En). 


Thus * is an isomorphism. 
Define the divergence operator div: J(M) > C®(M) by 


div X = *~'d(X 1dV,), 
or equivalently, 
d(X 1dV,) = (div X)dV,. 


In the special case of a domain with smooth boundary in R°, the following 
theorem is due to Gauss and is often referred to as Gauss’s theorem. 
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Theorem 10.41 (The Divergence Theorem). Let M be a compact, 
oriented Riemannian manifold with boundary. For any smooth vector field 
X on M, 


[ (ax) dV, = [& N) dV;, 


where N is the outward-pointing unit normal vector field along OM and g 
is the induced Riemannian metric on OM. 


Proof. By Stokes’s theorem, 
i: (div X) dV, = | d(X 1dV,) 
M M 


= X jdVy. 
OM 


The theorem then follows from Lemma 10.38. 


Surface Integrals 


The original theorem that bears the name of Stokes concerned “surface 
integrals” of vector fields over surfaces in R?. Using the differential-forms 
version of Stokes’s theorem, this can be generalized to surfaces in Riemann- 
ian 3-manifolds. (For reasons that will be explained later, the restriction to 
dimension 3 cannot be removed.) 

Let (M,g) be an oriented Riemannian 3-manifold, and let 6: TM — 
A?M denote the bundle map defined by 6(X) = X 1dV,. It is easily seen 
to be a bundle isomorphism by checking its values on the elements of any 
orthonormal frame. 

Define an operator curl: J(M) — T(M) by 


curl X = B-'d(X°), 
or equivalently, 
curl X 1dV, = d(X°). (10.14) 


The following commutative diagram summarizes the relationships among 
the gradient, divergence, curl, and exterior derivative operators: 


grad curl div 


C™(M) 7(M) “" s(m) “% o-(M) 


Id b B * 


c~(M) 4+ al(m) 4+ 42(m) 4+ 43(M), (10.15) 


Problem 10-22 shows that the composition of any two horizontal arrows in 
this diagram is zero. 
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Now suppose S C M is a compact, embedded, 2-dimensional submanifold 
with or without boundary in M, and N is a smooth unit normal vector 
field along S. Let dA denote the induced Riemannian volume form on S$ 
with respect to the induced metric g|g and the orientation determined by 
N, so that dA = (N JdV,)|s by Proposition 10.37. For any smooth vector 
field X defined on M, the surface integral of X over S (with respect to the 
given choice of normal field) is defined as 


i: (X, N) dA. 


The next result, in the special case in which M = R’, is the original 
theorem proved by Stokes. 


Theorem 10.42 (Stokes’s Theorem for Surface Integrals). Sup- 
pose S is a compact, embedded, 2-dimensional submanifold with boundary 
ina Riemannian 3-manifold M, and suppose N is a smooth unit normal 
vector field along S. For any smooth vector field X on M, 


[fou x,.v) da = (X,T) ds, 
Ss Os 


where ds is the Riemannian volume form and T is the unique positively 
oriented unit tangent vector field on OS (with respect to the metric and 
orientation induced from S'). 


Proof. The general version of Stokes’s theorem applied to the 1-form X° 


yields 
| d(x°)= | X?. 
S Os 


Thus the theorem will follow from the following two equations: 
d(X°)|g = (curl X, N) dA, (10.16) 
X’ lag = (X,T) ds. (10.17) 


Equation (10.16) is just the defining equation (10.14) for the curl com- 
bined with the result of Lemma 10.38. To prove (10.17), we note that 
xX las is a 1-form on a 1-manifold, and thus must be equal to f ds for some 
smooth function f on 0S. To evaluate f, we note that ds(T) = 1, and so 
the definition of X° yields 


f = fds(T) = X°(L) = (X,T). 
This proves (10.17) and thus the theorem. 


The curl operator is defined only in dimension 3 because it is only in that 
case that A?M is isomorphic to TM (via the map 3: X + X JdV,). In 
fact, it was largely the desire to generalize the curl and the classical version 
of Stokes’s theorem to higher dimensions that led to the entire theory of 
differential forms. 
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Problems 


10-1. 
10-2. 


10-3. 


10-4. 


10-5. 


10-6. 


10-7. 


10-8. 


Prove that every smooth 1-manifold is orientable. 


Suppose M is a smooth manifold that is the union of two open sub- 
sets, each of which is diffeomorphic to an open subset of R”, and 
whose intersection is connected. Show that M is orientable. Use this 
to give another proof that S” is orientable. 


Suppose 7: M — M is asmooth covering map and M is orientable. 
Show that M is also orientable. 


Suppose M is a connected, oriented smooth manifold and IT is a 
discrete group acting freely and properly on M. We say the action is 
orientation-preserving if for each y € T’, the diffeomorphism 4+ y-x 
is orientation-preserving. Show that M/T is orientable if and only if 
T is orientation-preserving. 


Let E be the total space of the Mobius bundle, which is the quotient 
of R? by the Z-action n+ (x,y) = (x +n, (—1)"y) (see Problem 7-22). 
The Mobius band is the subset M C E that is the image under the 
quotient map of the set {(a, y) € R?: |y| < 1}. Show that neither EF 
nor M is orientable. 


Suppose M is a connected nonorientable smooth manifold. Define a 
set M by 


M= II {orientations of T,M}, 
pEeM 


and let 7: M — M be the obvious map that sends an orientation 
of T,M to p. Show that M has a unique smooth manifold structure 
such that 7 is a local diffeomorphism. With this structure, show that 
M is connected and orientable, and 7 is a 2-sheeted smooth covering 
map. [The covering 7: M — M is called the orientation covering of 
M. 


Let T? = S! x S' Cc R* denote the 2-torus, defined by w? + x? = 
y? +27 = 1, with the orientation determined by its product structure 
(see Exercise 10.4). Compute J. w, where w is the following 2-form 
on R?: 


w= wydz A dz. 


For each of the following 2-forms w on R*, compute f,. 9, where S? 
is oriented by its outward unit normal. 


(a) w=adyAdz+ydzAdr+zdxA dy. 
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10-9. 


10-10. 


10-11. 


10-12. 


10-13. 
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(b) w=azdyAdz+yzdzAdx+27 dx Ady. 


Let D denote the surface of revolution in R® obtained by revolving the 
circle (a—2)?+z? = 1 around the z-axis, with its induced Riemannian 
metric, and with the orientation induced by the outward unit normal. 


(a) Compute the surface area of D. 
(b) Compute the integral over D of the function f(z, y,z) = 2z+1. 
(c) Compute the integral over D of the 2-form w of Problem 10-8(b). 


Let w be the (n — 1)-form on R” \ {0} defined by 


w= al” S\(-1) la dal A---AdziA---Adx”. (10.18) 


i=l 


(a) Show that w|sn-1 is the Riemannian volume element of S”~! 
with respect to the round metric. 


(b) Show that w is closed but not exact. 


Suppose M is an oriented Riemannian manifold, and S C M is an 
oriented hypersurface (with or without boundary). Show that there 
is a unique smooth unit normal vector field along S that determines 
the given orientation of S. 


Suppose S' is an oriented embedded 2-manifold with boundary in 
R?, and let C = OS with the induced orientation. By Problem 10- 
11, there is a unique smooth unit normal vector field N on S that 
determines the orientation. Let T’ be the oriented unit tangent vector 
field on C; let V be the unique vector field tangent to S' along C' that 
is outward-pointing; and let W be the restriction of N to C. Show 
that (Tp, Vp, W,) is an oriented basis for R? at each p € C. 


Let (M,g) be an oriented Riemannian n-manifold. With respect to 
any local coordinates (x‘), show that 


div (x'= = Tay Be (x' /aet(as)) ; 


where g;; are the components of g. Conclude that on R” with the 
Euclidean metric, 


Lt Og" 
w=1 


as 


10-14. 


10-15. 


10-16. 


10-17. 
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Show that the divergence operator on an oriented Riemannian mani- 
fold does not depend on the choice of orientation, and conclude that 
it is invariantly defined on all Riemannian manifolds. 


Let (M, g) be a compact, oriented Riemannian manifold with bound- 
ary, let g denote the induced Riemannian metric on OM, and let N 
be the outward unit normal vector field along OM. 


(a) Show that the divergence operator satisfies the following product 
rule for f Ee C@(M), X € T(M): 


div(fX) = f div X + (grad f, X). 


(b) Prove the following “integration by parts” formula: 


[ teraasxyav, = [ FX.) avy — [ (f div X) dV). 
M OM M 


Let (M,g) be an oriented Riemannian manifold. The operator 
A: C®@(M) > C®(M) defined by Au = div(gradu) is called the 
Laplace operator, and Au is called the Laplacian of u. A function 
u € C™®(M) is said to be harmonic if Au = 0. 


(a) If M is compact, prove Green’s identities: 


[ wavay, + f (gradu, gradu) dV, = [ u Nv dVj. 
M M aM 
(10.19) 


[ (uae — vAu) dV, = [ Nv—vNu)dv5, (10.20) 


where N and g are as in Problem 10-15. 


(b) If M is connected and 0M = @, show that the only harmonic 
functions on M are the constants. 


(c) If M is connected, OM 4 @, and u, v are harmonic functions on 
M whose restrictions to OM agree, show that u = v. 


Let (M,g) be a compact, connected, oriented Riemannian manifold, 
and let A be its Laplace operator. A real number 4 is called an eigen- 
value of A if there exists a smooth function wu on M, not identically 
zero, such that Au = Au. In this case, u is called an eigenfunction 
corresponding to A. 


(a) Prove that 0 is an eigenvalue of A, and that all other eigenvalues 
are strictly negative. 
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(b) If uw and v are eigenfunctions corresponding to distinct eigenval- 
ues, show that i uv dV, = 0. 


10-18. Let (M,g) be an oriented Riemannian n-manifold. This problem 
outlines an important generalization of the operator *: C°(M) — 
A”(M) defined in this chapter. 


(a) For each k = 1,...,n, show that there is a unique inner prod- 
uct on A*(T,M) with the following property: If (E;) is any 
orthonormal basis for TM and (e*) is the dual basis, then 
{e! : I is increasing} is an orthonormal basis for A*(T,,M). 

(b) For each k = 0,...,n, show that there is a unique bundle map 
*: AFM > A"-*M satisfying 

wA*7 = (w,n)dVo. 


This map is called the Hodge star operator. [Hint: First prove 
uniqueness, and then define * on elementary covectors defined 
with respect to an orthonormal basis.| 


(c) Show that *: A°(M) > A"(M) is given by *c = cdV,. 
(d) Show that * *w = (—1)*"—-)w if w is a k-form. 


10-19. Let (M,g) be a Riemannian manifold and X € J(M). 
(a) Show that 
X 1dVy = *X?. 
(b) Show that 
div X = «d+ X°. 
(c) If M is 3-dimensional, show that 
curl X = (*d.X?)*., 
10-20. Prove that P” is orientable if and only if n is odd. [Hint: suppose 7 is 
a nonvanishing n-form on P”. Let 7: S” — P” denote the universal 


covering map, and a: S” > S” the antipodal map a(x) = —x. Then 
compute a*z*n two ways.] 


10-21. If w is a symplectic form on a 2n-manifold, show that wA---Aw (the 
n-fold wedge product of w with itself) is a nonvanishing 2n-form on 
M, and thus every symplectic manifold is orientable. 


10-22. Show that curlograd = 0 and divocurl = 0 on any Riemannian 
3-manifold. 


10-23. 


10-24. 


10-25. 
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On R? with the Euclidean metric, show that 


O O O 
curl (Fz +O 5, +R) 
_ (OR AQ\ A s OP OR\ O i OQ OP\ O 
~ \ dy dz) Ox Oz Ox) Oy Ox Oy) Oz 


Show that any finite product M, x --- x Mx of smooth manifolds 
with corners is again a smooth manifold with corners. Give a coun- 
terexample to show that a finite product of smooth manifolds with 
boundary need not be a smooth manifold with boundary. 


Suppose M is a smooth manifold with corners, and let C denote the 
set of corner points of M. Show that M ~ @ is a smooth manifold 
with boundary. 
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11 
De Rham Cohomology 


In Chapter 9 we defined closed and exact forms: A differential form w is 
closed if dw = 0, and exact if it is of the form dyn. Because d? = 0, every 
exact form is closed. In this chapter, we explore the implications of the 
converse question: Is every closed form exact? The answer, in general, is 
no: In Example 4.23 we saw an example of a closed 1-form on R” \ {0} that 
was closed but not exact. In that example, the failure of exactness seemed 
to be a consequence of the “hole” in the center of the domain. For higher- 
degree forms, the answer to the question depends on subtle topological 
properties of the manifold, connected with the existence of “holes” of higher 
dimensions. Making this dependence quantitative leads to a new set of 
invariants of smooth manifolds, called the de Rham cohomology groups, 
which are the subject of this chapter. 

There are many situations in which knowledge of which closed forms are 
exact has important consequences. For example, Stokes’s theorem implies 
that if w is exact, then the integral of w over any compact submanifold with- 
out boundary is zero. Proposition 4.22 showed that a 1-form is conservative 
if and only if it is exact. 

We begin by defining the de Rham cohomology groups and proving some 
of their basic properties, including diffeomorphism invariance. Then we 
prove that they are in fact homotopy invariants, which implies in partic- 
ular that they are topological invariants. Using elementary methods, we 
compute some de Rham groups, including the zero-dimensional groups of 
all manifolds, the one-dimensional groups of simply connected manifolds, 
the top-dimensional groups of compact manifolds, and all of the de Rham 
groups of star-shaped open subsets of R”. Then we prove a general theorem 
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that expresses the de Rham groups of a manifold in terms of those of its 
open subsets, called the Mayer—Vietoris theorem, and use it to compute all 
the de Rham groups of spheres. 

At the end of the chapter, we turn our attention to the de Rham theorem, 
which expresses the equivalence of the de Rham groups with another set 
of groups defined purely topologically, the singular cohomology groups. 
To set the stage, we first give a brief summary of singular homology and 
cohomology theory, and prove that singular homology can be computed by 
restricting attention only to smooth simplices. In the last section, we prove 
the de Rham theorem. 


The de Rham Cohomology Groups 


In Chapter 4, we studied the closed 1-form 


pe a a (11.1) 
w+y 
and showed that it is not exact on R? \ {0}, but it is exact on some smaller 
domains such as the right half-plane H = {(2, y) : « > 0}, where it is equal 
to d@ (see Example 4.26). 

As we will see in this chapter, this behavior is typical: closed p-forms are 
always locally exact, so the question of whether a given closed form is exact 
depends on the global shape of the domain, not on the local properties of 
the form. 

Let M be a smooth manifold. Because d: A?(M) — A?*!(M) is linear, 
its kernel and image are linear subspaces. We define 


Z?(M) = Ker[d: A?(M) — A?*!(M)] = {closed p-forms}, 
B?(M) = Im{d: A?~'(M) — A?(M)] = {exact p-forms}. 


By convention, we consider A?(M) to be the zero vector space when p < 0 
or p > n=dimM, so that that, for example, B°(M) = 0 and 2"(M) = 
A”(M). 

The fact that every exact form is closed implies that B?(M) Cc Z?(M). 
Thus it makes sense to define the pth de Rham cohomology group (or just 
de Rham group) of M to be the quotient vector space 


ZP(M) 


(It is, in particular, a group under vector addition. Perhaps “de Rham 
cohomology space” would be a more appropriate term, but because most 
other cohomology theories produce only groups it is traditional to use the 
term group in this context as well, bearing in mind that these “groups” are 
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actually real vector spaces.) For any closed form w on M, we let [w] denote 
the equivalence class of w in this quotient space, called the cohomology class 
of w. Clearly H4,(M) = 0 for p > dim M, because A?(M) = 0 in that case. 
If [w] = [w"] (that is, if w and w’ differ by an exact form), we say that w 
and w’ are cohomologous. 

The first thing we will show is that the de Rham groups are diffeomor- 
phism invariants. 
Proposition 11.1 (Induced Cohomology Maps). For any smooth 
map G: M — N, the pullback G*: A?(N) > A®(M) carries Z?(N) into 
ZP(M) and B?(N) into BP(M). It thus descends to a linear map, still de- 
noted by G*, from H?p(N) to Hip(M), called the induced cohomology map. 
It has the following properties: 


(a) If F: N > P is another smooth map, then 
(Fo G)* =G"* o F*: H3,(P) > Hip(M). 


(b) If Idyy denotes the identity map of M, then (Idys)* ts the identity 
map of Hi p(M). 
Proof. If w is closed, then 
d(G*w) = G* (dw) = 0, 
so G*w is also closed. If w = dn is exact, then 
Gtw = G*(dn) = d(C"), 


which is also exact. Therefore, G* maps 2?(N) into Z?(M) and B?(N) into 
BP(M). The induced cohomology map G*: H?,(N) > Hip(M) is defined 
in the obvious way: For a closed p-form w, let 


G*|w] = [G*ul. 
If w = w+ dn, then [G*w’] = [G*w + d(G*n)] = [G*w], so this map is 


well-defined. Properties (a) and (b) follow immediately from the analogous 
properties for the pullback map on forms. 


The next two corollaries are immediate. 


Corollary 11.2 (Functoriality). For each integer p > 0, the assignment 
M +> H!,(M), F + F* is a contravariant functor from the category of 
smooth manifolds and smooth maps to the category of real vector spaces 
and linear maps. 


Corollary 11.3 (Diffeomorphism Invariance). Diffeomorphic mani- 
folds have isomorphic de Rham cohomology groups. 
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In this section, we will present a profound generalization of Corollary 11.3, 
one surprising consequence of which will be that the de Rham cohomology 
groups are actually topological invariants. In fact, they are something much 
more: homotopy invariants. 

Recall that a continuous map F’: M — N between topological spaces is 
said to be a homotopy equivalence if there is a continuous map G: N — M 
such that FoG ~ Idy and GoF ~ Idy. Such a map G is called a homotopy 
inverse for F’. If there exists a homotopy equivalence between M and N, the 
two spaces are said to be homotopy equivalent. For example, the inclusion 
map J: S°-! G R"\ {0} is a homotopy equivalence with homotopy inverse 
r(a) = x/|a|, because rot = Idgn-1 and the identity map of R” \ {0} is 
homotopic to cor via the straight-line homotopy H (x,t) = «+ (1—t)a/|a]. 

The underlying fact that will allow us to prove the homotopy invari- 
ance of de Rham cohomology is that homotopic smooth maps induce the 
same cohomology map. To motivate the proof, suppose F,G: M — N are 
smoothly homotopic maps, and let us think about what needs to be shown. 
Given a closed p-form w on N, we need somehow to produce a (p— 1)-form 
7 on M such that 


dn = F*w — G*w. (11.2) 


One might hope to construct 7 in a systematic way, resulting in a map h 
from closed p-forms on M to (p — 1)-forms on N that satisfies 


d(hw) = F*w — G*w. (11.3) 


Instead of defining hw only when w is closed, it turns out to be far simpler 
to define for each p a map h from the space of all p-forms on N to the space 
of (p—1)-forms on M. Such maps cannot satisfy (11.3), but instead we will 
find maps that satisfy 


d(hw) + h(dw) = F*w — G*w. (11.4) 


This implies (11.3) when w is closed. 

In general, if F,G: M — N are smooth maps, a collection of linear maps 
h: A?(N) > AP-1(M) such that (11.4) is satisfied for each p is called a 
homotopy operator between F* and G*. (The term cochain homotopy is 
used more frequently in the algebraic topology literature.) The key to our 
proof of homotopy invariance will be to construct a homotopy operator 
first in the following special case. For each t € [0,1], let i4: M — M x I be 
the embedding 


iz(x) = (a, t). 


Clearly to is homotopic to 71. (The homotopy is the identity map of M x I!) 
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Lemma 11.4 (Existence of a Homotopy Operator). For — any 
smooth manifold M, there exists a homotopy operator between the maps io 
and 11 defined above. 


Proof. For each p, we need to define a linear map h: A?(MxI) — AP-!(M) 
such that 


h(dw) + d(hw) = ijw — ijw. (11.5) 


We define h by the formula 
1 
hw =i) (5) dt, 


where t is the coordinate on I. More explicitly, hw is the (p — 1)-form on 
M whose action on vectors X1,...,Xp—-1 € TM is 


1 
) 
Ci irene ‘ (5 4444) (X1,...,Xp-1) dt 
0 
1 
= | wq,t)(0/Ot, X1, sae ,Xp-1) dt. 

0 
To show that h satisfies (11.5), we consider separately the cases in which 
w= f(a,t)dt\ dr" A--»Adx?-1 and w = f(z, t) dx" A--- A dx’; since h 
is linear and every p-form on M x I can be written as a sum of such forms, 


this suffices. 
CaAsE I: w = f(x, t) dtAdx® A---Adx*»-1. In this case, because dtAdt = 0, 


1 
ath) =a((f setae) de nna) 
0 
0 
= (/ f(a, t) ar) dx \ dx A+++ A dx'?-1 
Oxi \ Jo 


1 
= (/ oF t) ar) dx? \ dx™ A+++ A dax'r-?, 
0 OxI 


On the other hand, 


h(dw) =h Olas A dt Ada \-+» Ada‘? 
OxI 
‘OF a 
= | (2,t)= 1 (dei A dt A dx” \--- A dx'-1) dt 
f Dood (2 Jay J (dx? A A daz"t A-+-A da'?-?) 


‘ of j a p41 
--(f SE (ast) dt) de! de A+++ A dx"? 
= —d(hw). 


276 11. De Rham Cohomology 


Thus the left-hand side of (11.5) is zero in this case. The right-hand side 
is zero as well, because ijdt = ijdt = 0 (since t o ig and to é, are constant 
functions). 

CasE II: w = fdr! A--- A dzx’v. Now 0/0t tw = 0, which implies that 
d(hw) = 0. On the other hand, by the fundamental theorem of calculus, 


h(dw) =h (3 


1 
2 ( i 2L x,t) rns yeti 
= (f(x,1) — f(x, 0))dx*! A--- Adz"? 


fk 3k 
= 1W — Ig, 


dt A dx™! \--.» Ada? + terms without ar) 


which proves (11.5) in this case. 


Proposition 11.5. Let F,G: M — N be homotopic smooth maps. For 
every p, the induced cohomology maps F*,G*: Hip(N) > Hip(M) are 
equal. 


Proof. By Proposition 6.20, there is a smooth homotopy H: Mx I — 
N from F to G. This means that H oig = F_and H oi, = G, where 
i9,41: M — M x I are defined as above. Let h be the composite map 
h = ho H*: AP(N) — AP-1(M): 


A?(N) 5, aP(M x I) > AP-1(M), 


where h is the homotopy operator constructed in Lemma 11.4. 
For any w € A?(N), we compute 


h(dw) + d(hw) = h(H*dw) + d(hH*w) 
= hd(H*w) + dh(H*w) 
= i H*w —ipH*w 
= (H 011)*w — (H 0 ig)*w 
= G*w — F*w. 


Thus if w is closed, 
G*|w] — F*[w] = [G*w — F* | 
[A(dw) + d(hw)] 


0, 


where the last line follows from dw = 0 and the fact that the cohomology 
class of any exact form is zero. 


The next theorem is the main result of this section. 
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Theorem 11.6 (Homotopy Invariance of de Rham Cohomology). 
If M and N_ are homotopy equivalent smooth manifolds, then 
Hi (M) = Hip(N) for each p. 


Proof. Suppose F': M — N is a homotopy equivalence, with homotopy 
inverse G: N — M. By Theorem 6.19, there are smooth maps fF’: M — N 
homotopic to F and G: N — M homotopic to G. Because homotopy is 
preserved by composition, it follows that FoG ~ FoG ~Idy and GoF' ~ 
Go F ~Idy, so F and G are homotopy inverses of each other. 

Now Proposition 11.5 shows that, on cohomology, 


Fo & = (Go B)" = du)" = Idug,an 


The same argument shows that G* o F* is also the identity, so 
F*: H4,(N) > Hip(M) is an isomorphism. 


Corollary 11.7. The de Rham cohomology groups are topological invari- 
ants: If M and N are homeomorphic smooth manifolds, then their de Rham 
groups are isomorphic. 


This result is remarkable, because the definition of the de Rham groups 
of M is intimately tied up with its smooth structure, and we had no reason 
to expect that different differentiable structures on the same topological 
manifold should give rise to the same de Rham groups. 


Computations 


The direct computation of the de Rham groups is not easy in general. 
However, in this section, we will compute them in several special cases. 
We begin with disjoint unions. 


Proposition 11.8 (Cohomology of Disjoint Unions). Let {4;} be a 
countable collection of smooth manifolds, and let M = LU; M;. For each p, 
the inclusion maps .;: M; — M induce an isomorphism from H',(M) to 
the direct product space I; HES). 


Proof. The pullback maps 13: A?(M) — A?(M;) already induce an iso- 
morphism from A?(M) to |]; A?(M;), namely 

w+ (the, we, ..) = (wl masters): 
This map is injective because any p-form whose restriction to each M; is 


zero must itself be zero, and it is surjective because giving an arbitrary 
p-form on each M; defines one on M. 
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Because of this proposition, each de Rham group of a disconnected man- 
ifold is just the direct product of the corresponding groups of its compo- 
nents. Thus we can concentrate henceforth on computing the de Rham 
groups of connected manifolds. 

Our next computation gives an explicit characterization of zero- 
dimensional cohomology. 


Proposition 11.9 (Zero-Dimensional Cohomology). If M is a con- 
nected manifold, H?,(M) is equal to the space of constant functions and is 
therefore one-dimensional. 


Proof. Because there are no (—1)-forms, B°(M) = 0. A closed 0-form is a 
smooth function f such that df = 0, and since M is connected this is true 
if and only if f is constant. Thus H9,(M) = 2°(M) = {constants}. 


Corollary 11.10 (Cohomology of Zero-Manifolds). If M is a 0- 
dimensional manifold, the dimension of H9¢p(M) is equal to the cardinality 
of M, and all other de Rham cohomology groups vanish. 


Proof. By Propositions 11.8 and 11.9, H°,(M) is isomorphic to the direct 
product of one copy of R for each component of MW, which is to say each 
point. 


Next we examine the de Rham cohomology of Euclidean space, and more 
generally of its star-shaped open subsets. (Recall that a subset V C R” is 
said to be star-shaped with respect to a point q € V if for every x € V, the 
line segment from q to « is entirely contained in V.) In Proposition 4.27, 
we showed that every closed 1-form on a star-shaped open subset of R” is 
exact. The next theorem is a generalization of that result. 


Theorem 11.11 (The Poincaré Lemma). Let U be a star-shaped open 
subset of R". Then Hip(U) =0 for p> 1. 


Proof. Suppose U C R” is star-shaped with respect to g. The key feature 
of star-shaped sets is that they are contractible, which means the identity 
map of U is homotopic to the constant map sending U to q, by the obvious 
straight-line homotopy: 


H(ax,t)=q+t(a—q). 


Thus the inclusion of {gq} into U is a homotopy equivalence. The Poincaré 
lemma then follows from the homotopy invariance of H? pn together with the 
obvious fact that H7,({q}) = 0 for p > 0 because {q} is a 0-manifold. 


Exercise 11.1. If U C R” is open and star-shaped with respect to 0 and 
w = >-'w; da’ is a closed p-form on U, show by tracing through the proof of 
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the Poincaré lemma that the (p — 1)-form 7 given explicitly by the formula 


P 1 Se a 
n= ~ Ye" (/ t?~*w7(tz) ar) xt dx't \---A data A---A dx’? 
I q=1 0 


satisfies dy = w. When w is a 1-form, show that 7 is equal to the potential 
function f defined in Proposition 4.27. 


The next two results are easy corollaries of the Poincaré lemma. 


Corollary 11.12 (Cohomology of Euclidean Space). For all p > 1, 
Hi, (R”) = 0. 


Proof. Euclidean space R” is star-shaped. 


Corollary 11.13 (Local Exactness of Closed Forms). Let M be a 
smooth manifold, and let w be a closed p-form on M, p > 1. For any 
q€M, there is a neighborhood U of q on which w is exact. 


Proof. Every q € M has a neighborhood diffeomorphic to an open ball 
in R”, which is star-shaped. The result follows from the diffeomorphism 
invariance of de Rham cohomology. 


One of the most interesting special cases is that of simply connected 
manifolds, for which we can compute the first cohomology explicitly. 


Theorem 11.14. If M is a simply connected smooth manifold, then 
H}q(M) = 0. 


Proof. Let w be a closed 1-form on M. We need to show that w is exact. By 
Theorem 4.22, this is true if and only w is conservative, that is, if and only 
if all line integrals of w depend only on endpoints. Since any two piecewise 
smooth curve segments with the same endpoints are path homotopic, the 
result follows from Theorem 10.33. 


Finally, we turn our attention to the top-dimensional cohomology of 
compact manifolds. We begin with the orientable case. Suppose M is 
an orientable compact smooth manifold. There is a natural linear map 
I: A"(M) = R given by integration over M: 


Hw)= fw 


Because the integral of any exact form is zero, J descends to a linear map, 
still denoted by the same symbol, from H?p(M) to R. (Note that every 
n-form on an n-manifold is closed.) 
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Theorem 11.15 (Top Cohomology, Orientable Case). For any 
compact, connected, orientable, smooth n-manifold M, the map 
I: Hip(M) > R is an isomorphism. Thus H'',(M) is one-dimensional, 
spanned by the cohomology class of any orientation form. 


Proof. The 0-dimensional case is an immediate consequence of Corollary 
11.10, so we may assume that n > 1. Let 2 be an orientation form for M, 
and set c= f,, 2. By Proposition 10.20(c), c > 0. Therefore, for any a € R, 
I[aw| = ac, so I: Hv'p(M) — R is surjective. To complete the proof, we 
need only show that it is injective. In other words, we have to show the 
following: If w is any n-form satisfying [. uw = 0, then w is exact. 

Let {U1,...,Um} be a finite cover of M by open sets that are diffeo- 
morphic to R”, and let My, = U, U--- UU, for k = 1,...,m. Since 
M is connected, by reordering the sets if necessary, we may assume that 
My NU p41 A @ for each k. We will prove the following claim by induction 
on k: If w is a compactly supported n-form on M;, that satisfies lige w=0, 
then there exists a compactly supported (n — 1)-form 1 on My such that 
dn = w. When k = m, this is the statement we are seeking to prove, 
because every form on a compact manifold is compactly supported. 

For k = 1, since M, = U, is diffeomorphic to R”, the claim reduces 
to a statement about compactly supported forms on R”. The proof of this 
statement is somewhat technical, so we postpone it to the end of this section 
(Lemma 11.19). Assuming this for now, we continue with the induction. 

Assume the claim is true for some k > 1, and suppose w is a compactly 
supported n-form on M41 = M,UUz41 that satisfies SMa: w = 0. Choose 
an auxiliary n-form Q € A"(Mxz41) that is compactly supported in M,N 
U;,+1 and satisfies [ Mes Q = 1. (Such a form is easily constructed by using 
a bump function in coordinates.) Let {y,w} be a partition of unity for 
My+1 subordinate to the cover {Mx, Up41}. 

Let c = f Mya1 0X: Observe that yw — cQ is compactly supported in 
My, and its integral is equal to zero by our choice of c. Therefore, by the 
induction hypothesis, there is a compactly supported (n — 1)-form @ on 
My, such that da = gw — cO. Similarly, ww + cQ is compactly supported in 
Uxz41, and its integral is 


fw +cQ) = [e —y)w+ ef. Q 


=f vf pu +c 
Mr+1 Mr+i 


= 0: 


Thus by Lemma 11.19, there exists another (n — 1)-form (3, compactly 
supported in U;.41, such that d8 = yw+cQ. Both a and 7 can be extended 
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by zero to smooth compactly supported forms on M;41. We compute 


da + 8) = (pw — cQ) + (bw + Q) = (y+ yw =u, 


which completes the inductive step. 


We now have enough information to compute the de Rham cohomology 
of the circle and the punctured plane completely. 


Corollary 11.16 (Cohomology of the Circle). The de Rham coho- 
mology groups of the circle are as follows: H¢p(S') and Hjp(S') are both 
1-dimensional, spanned by the function 1 and the cohomology class of the 
form w defined by (11.1), respectively. 


Proof. Because the restriction of w to S! never vanishes, it is an orientation 
form. 


Corollary 11.17 (Cohomology of the Punctured Plane). Let M = 
R?\ {0}. Then H9,(M) and Hjp(M) are both 1-dimensional, spanned by 
the function 1 and the cohomology class of the form w defined by (11.1), 
respectively. 


Proof. Because the inclusion map 1: S! ~ M is a homotopy equivalence, 
u*: Hip(M) — Hj,(S') is an isomorphism. The result then follows from 
the preceding corollary. 


Next we consider the nonorientable case. 


Theorem 11.18 (Top Cohomology, Nonorientable Case). Let M 
be a compact, connected, nonorientable, smooth n-manifold. Then 


Hi},(M) = 0. 


Proof. We have to show that every n-form on M is exact. By the result of 
Problem 10-6, there is an orientable manifold M and a 2-sheeted smooth 
covering map 7: M — M. Let a: M — M be the map that interchanges 
the two points in each fiber of 7. If U C M is any evenly covered open set, 
then a just interchanges the two components of t~!(U), so a is a smooth 
covering transformation; in fact, it is the unique nontrivial covering trans- 
formation of 7. Now, a@ cannot be orientation-preserving—if it were, the 
entire covering group {Idjz,a} would be orientation-preserving, and then 
M would be orientable by the result of Problem 10-4. By connectedness of 
M and the fact that a is a diffeomorphism, it follows that a is orientation- 
reversing. ~ 

Suppose w is any n-form on M, and let Q = m*w € A"(M). Then 
7 0@=7 implies 


AQ = a*n*w=(roa)*w=mw=. 
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Because qa is orientation-reversing, therefore, we conclude from Proposition 


10.16 that 
[o--fea--f 9. 
M M M 


This implies that trae = 0, so by Theorem 11.15, there exists 7 € 


A"-1(M) such that dy = Q. Let 7 = 3(7 + a*n). Using the fact that 
aoa=Idj, we compute 


and 
dij = 3(dn + do*n) 
= 5(dn + a*dn) 
= 3(2 + a*Q) 
Q. 


Now let U C M be any evenly covered open set. There are exactly 
two smooth local sections 01,02: U — M over U, which are related by 
02 = a0 0,4. Observe that 


037) = (a0 01)" = 0107 = O77. 
Therefore, we can define a global (n—1)-form 7 on M by setting y = o*7 for 


any local section o. To determine its exterior derivative, choose a smooth 
local section o in a neighborhood of any point, and compute 


dy = do*7n = o* dy = 0D = o* t*w = (nOG)*w =u, 


because 700 = Idy. 


Finally, here is the technical lemma that is needed to complete the proof 
of Theorem 11.15. It can be thought of as a refinement of the Poincaré 
lemma for compactly supported n-forms. 


Lemma 11.19. Let n > 1, and suppose w is a compactly supported n- 
form on R” such that fat w=0. Then there exists a compactly supported 
(n —1)-form n on R” such that dy = w. 


Remark. Of course, we know that w is exact by the Poincaré lemma, so the 
novelty here is the claim that we can find a compactly supported (n—1)-form 
7 such that dn = w. 


Proof. We will carry out the proof by induction on n. For n = 1, we can 
write w = f dx for some smooth compactly supported function f. Choose 
R-> 0 such that supp f C [—R, R], and define F: R > R by 


F(x) = / : f(t) dt. 
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Then clearly dF = F'(x) dx = f dx = w. When x < —R, F(x) = 0 by our 
choice of R. When x > R, the fact that f,w = 0 translates to 


R x 
0= [ foe- [foe F(z), 


so in fact supp FC [—R, R]. This completes the proof for the case n = 1. 
Now let n > 1, and suppose the lemma is true on R”. Let us consider 
R”*! as the product space Rx R”, with coordinates (y, x) = (y,x',..., x2”). 
Let Q = dx! A --- Adz”, considered as an n-form on R x R”. 
Suppose w is any compactly supported (n+ 1)-form on R x R” such that 


| w=0. 
RxR” 


w= fdyAQ 


Then w can be written 


for some compactly supported smooth function f. Choose R > 0 such that 
supp f is contained in the set {(y,z) : |y| < R and |z| < R}. 

Let 0 R — R be any bump function supported in [—R, R] and satisfying 
Je ly) dy = 1. Define smooth functions e, E, F, F:RxR” = Ras follows: 


e(y, x) = p(y); 
E(y,2) = dhe p(t) dt; 


=f se24 
a) = ‘e f(t,2) dt = F(R,2). 


These functions have the properties 


de OE OE OF . OF 


axl Aah ay” Oy 


Let © denote the n-form 1*(F 2) on R”, where 1: R” G R x R” is the 
embedding s(x) = (0, x). Its integral satisfies 


[== F(0,«) dx. -- dx” 


“fk ) dy da’ - 
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Therefore, by the inductive hypothesis, there exists a compactly supported 
(n — 1)-form o on R” such that do = &. 
Now define an n-form 7 on R x R” by 


n = (F — FE)Q—edyAn*o, 


where 7: R x R” — R” is the projection. When y < R, we have F = F= 
e = 0, so 7 vanishes there. When y > R, thene =0, H=1, and F = F, 
so 7 = 0 there also. Finally, when || is sufficiently large, then F’, F’, and 
o are all zero, so 7 = 0 there as well. Thus 7 is compactly supported. 

To show that dj = w, we compute 


dn = dF \AQ— FdEAQ— EdF AQ —deAdyAr*o+edy A d(x*o). 


We consider each of these five terms separately. The first term is 


OF OF, 
= {| —— — dr 
dF AQ (Fray + Sac!) ao 
= fdyAQ 
=, 


because dx? \Q = 0. For the second term we have 


~OE 


-FdEAQ =—F>-dyAQ=—Fedy AQ 
y 


Because OF /Oy = 0, the third term reduces to 


_pdFag = —REoe ai AQ =0. 
Ox 


The fourth term is likewise zero because deA dy = y’(y)dy Ady = 0. For the 
fifth term, we observe that 7*.* F' = F' because F' is independent of y, and 
m*u*Q = Q by direct computation, and therefore 7*ui = FQ. Therefore, 
edy A d(x*o) = edy A x* (do) 
=edyA*(X) 
=Fe dy \Q. 


Thus the second and fifth terms cancel, and we are left with dy = w. 


For some purposes, it is useful to define a generalization of the de Rham 
cohomology groups using only compactly supported forms. Let A®(M/) de- 
note the space of compactly-supported p-forms on M. The pth de Rham 
cohomology group of M with compact support is the quotient space 


Byram VAM JE) 
- Im[d: A27'(M) > A?(M)] 


c 
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Of course, when M is compact, this just reduces to ordinary de Rham co- 
homology. But for noncompact manifolds, the two groups can be different, 
as the next exercise shows. 


Exercise 11.2. Using Lemma 11.19, show that H?(R”) is 1-dimensional. 


We will not use compactly supported cohomology in this book, but it 
plays an important role in algebraic topology. 


The Mayer—Vietoris Theorem 


In this section, we prove a very general theorem that can be used to compute 
the de Rham cohomology groups of many spaces, by expressing them as 
unions of open submanifolds with simpler cohomology. 

For this purpose, we need to introduce some simple algebraic concepts. 
More details about the ideas introduced here can be found in [Lee00, Chap- 
ter 13] or in any textbook on algebraic topology. 

Let & be a commutative ring, and let A* be any sequence of R-modules 
and linear maps: 


Ap-) 4, ap 4, apt 


(In all of our applications, the ring will be either Z, in which case we are 
looking at abelian groups and homomorphisms, or R, in which case we 
have vector spaces and linear map. The terminology of modules is just a 
convenient way to combine the two cases.) 

Such a sequence is said to be a complex if the composition of any two 
successive applications of d is the zero map: 


dod=0: A? — A?*? for each p. 


It is called an exact sequence if the image of each d is equal to the kernel 
of the next: 


Im[d: A?~' — A?] = Ker[d: A? > AP*1]. 


Clearly every exact sequence is a complex, but the converse need not be 
true. If A* is a complex, then the image of each map d is contained in the 
kernel of the next, so we define the pth cohomology group of A* to be the 
quotient module 


_ Kerld: A? — APt?)] 


ae ae Im[d: A?-! — AP] © 


It can be thought of as a quantitative measure of the failure of exactness 
at A?. (In algebraic topology, a complex as we have defined it is usually 


286 11. De Rham Cohomology 


called a cochain complex, while a chain complex is defined similarly except 
that the maps go in the direction of decreasing indices: 


o 


Ap+1 Ap-1 
In that case, the term homology is used in place of cohomology.) 

If A* and B* are complexes, a cochain map from A* to B*, denoted by 
F: A* — B*, is a collection of linear maps F': A? — B? (it is easiest to 
use the same symbol for all of the maps) such that the following diagram 
commutes for each p: 


> AP d > Aptl—-+... 


F| |r 


. —— > Bp Beti—... 


The fact that F'od=doF means that any cochain map induces a linear 
map on cohomology F*: H?(A*) — H?(B*) for each p, just as in the case 
of de Rham cohomology. 

A short exact sequence of complexes consists of three complexes 
A*, B*,C*, together with cochain maps 


(oar pe Sor sG 


such that each sequence 


G 


0 — AP =, pe &, cP 0 


is exact. This means F is injective, G is surjective, and Im F = KerG. 


Lemma 11.20 (The Zigzag Lemma). Given a short exact sequence of 
complexes as above, for each p there is a linear map 


6: H?(C*) — H?*1(A*), 


called the connecting homomorphism, such that the following sequence is 
exact: 


Spa) 25 nee) Swe) Sa) 16) 


Proof. We will sketch only the main idea; you can either carry out the 
details yourself or look them up. 


The Mayer—Vietoris Theorem 287 


The hypothesis means that the following diagram commutes and has 
exact horizontal rows: 


F G 


0—~ a? ~ BP ~crP—+0 
Ea eae 

0 —> art! 2. pett —F  crt1 —~9 
Pe wpe 

0 —> arv2 FE. prv2 —G . cra + 0, 


Suppose c? € C? represents a cohomology class; this means that dc? = 0. 
Since G: BP? — C? is surjective, there is some element 6? € B? such that 
Gb? = cP. Because the diagram commutes, Gdb? = dGb? = dc? = 0, and 
therefore db? € KerG = Im F. Thus there exists a?*! € A?*! satisfying 
Fa?*+ = db. By commutativity of the diagram again, Fda?*! = dFa?t! = 
ddb” = 0. Since F is injective, this implies da?+! = 0, so a?t represents a 
cohomology class in H?*+!(A*). The connecting homomorphism 6 is defined 
by setting 5[c?] = [a?**] for any such a?*! € AP*!, that is, provided there 
exists b? € BP such that 


Gb? = cP, 
Fa?! = db’. 


A number of facts have to be verified: that the cohomology class [a?*1] 
is well-defined, independently of the choices made along the way; that the 
resulting map 6 is linear; and that the resulting sequence (11.6) is exact. 
Each of these verifications is a routine “diagram chase” like the one we 
used to define 6; the details are left as an exercise. 


Exercise 11.3. Complete (or look up) the proof of the zigzag lemma. 


The situation in which we will apply this lemma is the following. Suppose 
M is a smooth manifold, and U,V are open subsets of M such that M = 
U UV. We have the following diagram of inclusions: 


V (11.7) 


288 11. De Rham Cohomology 


which induce pullback maps on differential forms: 


o 


A?(M) AP(U NV), 
(V) 


as well as corresponding induced cohomology maps. Note that these pull- 
back maps are really just restrictions: For example, k*w = wly. We will 
consider the following sequence: 


0 AP(m) ©28", aP(u) @ AP(V) 2% apt nV) 30, (11.8) 
where 


(k* @l*)w = (k*w, lw), 


(@* — j*)(w,n) =u — j*n. (11.9) 


Because pullbacks commute with d, these maps descend to linear maps on 
the corresponding de Rham cohomology groups. 


Theorem 11.21 (Mayer—Vietoris). Let M be a smooth manifold, and 
let U,V be open subsets of M whose union is M. For each p, there is a 
linear map 6: Hip(UNV) > Het (M) such that the following sequence is 
exact: 


. 5, HP (M) © 28", APU) @ H®,(V) > HP (UNV) 


SS esi (11.10) 
Remark. The sequence (11.10) is called the Mayer—Vietoris sequence for 


the open cover {U,V}. 


Proof. The heart of the proof will be to show that the sequence (11.8) is 
exact for each p. Because pullback maps commute with the exterior deriv- 
ative, (11.8) therefore defines a short exact sequence of chain maps, and 
the Mayer-—Vietoris theorem follows immediately from the zigzag lemma. 
We begin by proving exactness at A?(M), which just means showing 
that k* @ I* is injective. Suppose that o € A?(M) satisfies (k* @ I*)o = 
(olu,o|v) = (0,0). This means that the restrictions of o to U and V are 
both zero. Since {U,V} is an open cover of M, this implies that o is zero. 
To prove exactness at A?(U) @ A?(V), first observe that 


(@— 7") 0 (k* GE )(o) =(@ —F )(olu, olv) = cunv — olunv = 9, 
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which shows that Im(k* @1*) C Ker(i* — j*). Conversely, suppose (7, 17’) € 
AP(U) 6 AP(V) and (i* — j*)(n, 7’) = 0. This means n|unv = 7'|unv, so 
there is a global smooth p-form o on M defined by 


Clearly (7, 7’) = (k* @®l*)o, so Ker(i* — 9*) C Im(k* @ I*). 

Exactness at A?(U MV) means that i* — j* is surjective. This the only 
nontrivial part of the proof, and the only part that really uses any properties 
of smooth manifolds. 

Let w € AP(UNV) be arbitrary. We need to show that there exist 
1 € AP(U) and 7’ € A?(V) such that 


w= =- 7 )(n.9/) = 09-9"! =nloav —n'lunv- 
Let {y,%} be a partition of unity subordinate to the open cover {U,V}, 
and define 7 € A?(V) by 


_ re on UNV, 1.11) 


0 on U \ supp yw. 
On the set (UNV) \ supp w where these definitions overlap, they both give 


zero, so this defines 7) as a smooth p-form on U. Similarly, define 7’ € A?(V) 
by 


(11.12) 


,_ J—-gw, on UNV, 
eal 0 on V \ supp y. 


Then we have 


nlunv —1'lunv = pw — (—gw) = (w+ y)w =u, 


which was to be proved. 


For later use, we record the following corollary to the proof, which ex- 
plicitly characterizes the connecting homomorphism 0. 


Corollary 11.22. The connecting homomorphism 6: Hip(UUNV) 
H?t*(M) is defined as follows. Given w € Z?(UNV), there are p-forms 
n € AP(U) and 7! € AP(V) such that w = nlunv — 7'\unv, and then 
d[w] = [dy], where dn is extended by zero to all of M. 


Proof. A characterization of the connecting homomorphism was given in 
the proof of the zigzag lemma. Specializing this characterization to the 
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situation of the short exact sequence (11.8), we find that 6[w] = [oc] provided 
there exist (7, 7’) € A?(U) 6 A?(V) such that 


23k ok 7 
in — jn! =u, 
(k*a,l*o) = (dn, dn’). Lt) 

Arguing just as in the proof of the Mayer—Vietoris theorem, if {y, a} 
is a partition of unity subordinate to {U,V}, then formulas (11.11) and 
(11.12) define smooth forms 7 € A?(U) and 7’ € A?(V) satisfying the first 
equation of (11.13). Let o be the form on M obtained by extending dy to 
be zero outside of UM V. Because w is closed, 


olunv = dnlunv =dWwt+7')|\unv = aq |unv; 


and the second equation of (11.13) follows easily. 


Our first application of the Mayer—Vietoris theorem will be to compute 
all of the de Rham cohomology groups of spheres. In the last section of 
this chapter, we will use the theorem again as an essential ingredient in the 
proof of the de Rham theorem. 


Theorem 11.23. For n> 1, the de Rham cohomology groups of S” are 
n~ JR itfp=0 orp=n, 
AGR(S") = 

0 f0<p<n. 


Proof. We already know H¢,(S") and H,,(S") from the preceding section. 
For good measure, we give here another proof for Hijp(S"). 

Let N and S be the north and south poles in S”, respectively, and let 
U =S"\ {S}, V =S"\ {N}. By stereographic projection, both U and V 
are diffeomorphic to R”, and thus UN V is diffeomorphic to R” \ {0}. 

Part of the Mayer—Vietoris sequence for {U,V} reads 


H®,'(U) @ H2,'(V) > B23’ (UNV) = H2,(S") > H2,(U) © H2,(V). 


Because U and V are diffeomorphic to R”, the groups on both ends are 
trivial when p > 1, which implies that H?,(S") = H?2'(UNV). Moreover, 
UNV is diffeomorphic to R” \ {0} and therefore homotopy equivalent to 
S"—!, so in the end we conclude that 


H®,(S") = Hip" (S°-') for p> 1. 


We will prove the theorem by induction on n. The case n = 1 is taken 
care of by Corollary 11.16, so suppose n > 2 and assume the theorem is 
true for S"~!. Clearly H9,,(S") © R by Proposition 11.9, and Hjp(S”) = 0 
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because S” is simply connected. For p > 1, the inductive hypothesis then 


gives 
A 0 ifp<n 
TOS aa eb (Sia 
ar(S") ar ( ) R ifp=n. 


II 


This completes the proof. 


Singular Homology and Cohomology 


The topological invariance of the de Rham groups suggests that there 
should be some purely topological way of computing them. There is in- 
deed, and the connection between the de Rham groups and topology was 
first proved by de Rham himself in the 1930s. The theorem that bears his 
name is a major landmark in the development of smooth manifold theory. 
We will give a proof in the next section. 

In the category of topological spaces, there are a number of ways of defin- 
ing cohomology groups that measure the existence of “holes” in different 
dimensions, but that have nothing to do with differential forms or smooth 
structures. In this section, we describe the most straightforward one, called 
singular cohomology. Because a complete treatment of singular cohomol- 
ogy would be far beyond the scope of this book, we can only summarize 
the basic ideas here. For more details, you can consult a standard textbook 
on algebraic topology, such as [Bre93, Mun84, Spa89]. (See also [Lee00, 
Chapter 13] for a more concise treatment.) 

Suppose vp,...,Up are any p+ 1 points in some Euclidean space R”. 
They are said to be in general position if they are not contained in any 
(p — 1)-dimensional affine subspace. A geometric p-simplez is a subset of 
R” of the form 


k 
{ Daw o<t <1and at, 
i=0 i=0 


for some (p+ 1)-tuple (vo,...,vp) in general position. The points v; are 
called the vertices of the simplex, and the geometric simplex with vertices 
U0, +++,Up is denoted by (vp,..., Up). It is a compact convex set, in fact the 
smallest convex set containing {vo,...,Up}. The standard p-simplez is the 
simplex A, = (eo, €1,---,€p) C R?, where ep = 0 and ¢; is the ith standard 
basis vector. For example, Ag = {0}, Ai = [0,1], and Ag is the triangle 
with vertices (0,0), (1,0), and (0,1). 


Exercise 11.4. Show that a geometric p-simplex is a p-dimensional 
smooth manifold with corners smoothly embedded in R”. 
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Let M be a topological space. A continuous map a: A, — M is called 
a singular p-simplerz in M. The singular chain group of M in dimension 
p, denoted by C,(M), is the free abelian group generated by all singular 
p-simplices in M. An element of this group, called a singular p-chain, is 
just a finite formal linear combination of singular p-simplices with integer 
coefficients. 

One special case that arises frequently is that in which the space M is a 
convex subset of some Euclidean space R”™. In that case, for any ordered 
(p + 1)-tuple of points (wo,...,wp) in M (not necessarily in general po- 
sition), there is a unique affine map from R? to R™ that takes e; to w; 
for i = 0,...,p. The restriction of this affine map to A, is denoted by 
a(wo,...,Wp), and is called an affine singular simplex in M. 

For each i = 0,...,p, we define the ith face map in A, to be the affine 
singular (p — 1)-simplex F;,,: Ap,-1 — Ap defined by 


Fp = a(€o,..-,i,--+5 ep): 


(As usual, the hat indicates that e; is omitted.) It maps A,_; homeomor- 
phically onto the (p—1)-dimensional boundary face of A, opposite e;. The 
boundary of a singular p-simplex 0: A, — M is the singular (p — 1)-chain 
Oo defined by 


P 


Oo = So(-1)'e 0 Fi py. 


1i=0 


This extends uniquely to a group homomorphism 0: C,(M) — C,_1(M), 
called the singular boundary operator. The basic fact about the boundary 
operator is the following lemma. 


Lemma 11.24. If c is any singular chain, then 0(0c) = 0. 
Proof. The starting point is the fact that 
Fin 0 Fj p-1 = Fp 0 Fi-1p-1 (11.14) 


when 7 > j, which can be verified by following what both compositions do 
to each of the vertices of Ap_2. Using this, the proof of the lemma is just 
a straightforward computation. 


A singular p-chain c is called a cycle if 0c = 0, and a boundary if c = Ob 
for some singular (p + 1)-chain b. Let Z,(//) denote the set of singular 
p-cycles in M, and B,(M) the set of singular p-boundaries. Because 0 is a 
homomorphism, Z,(/) and B,(M) are subgroups of C,(M), and because 
000=0, B,(M) C Z,(M). The pth singular homology group of M is the 
quotient group 
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To put it another way, the sequence of abelian groups and homomorphisms 


= Cpir(M) % Op(M) % Op-1(M) > -- 
is a complex, called the singular chain complex, and H,(1) is the homology 
of this complex. 

Any continuous map F: M — WN induces a homomorphism 
Fy: Cp(M) — C,(N) on each singular chain group, defined by Fy(c) = 
o0 0 F for any singular simplex o and extended by linearity to chains. 
An easy computation shows that Fo 0 = 00 F, so F is a chain map, 
and therefore induces a homomorphism on singular homology, denoted 
by F..: H,(M) — H,(N). It is immediate that (Go F), = G, o F, and 
(Idy)» = Idx, (a1), So singular homology defines a covariant functor from 
the category of topological spaces and continuous maps to the category of 
abelian groups and homomorphisms. In particular, homeomorphic spaces 
have isomorphic singular homology groups. 


Proposition 11.25 (Properties of Singular Homology). 


(a) For any one-point space {q}, Ho({q}) ts the infinite cyclic group gen- 
erated by the cohomology class of the unique singular 0-simplex map- 
ping Ao to q, and H,({q}) =0 for all p £0. 


(b) Let {Mj} be any collection of topological spaces, and let M = ||, Mj. 
The inclusion maps 13: Mj; — M induce an isomorphism from 
©; Hp(M;) to Hp(M). 


(c) Homotopy equivalent spaces have isomorphic singular homology 
groups. 


Sketch of Proof. In a one-point space {q}, there is exactly one singular p- 
simplex for each p, namely the constant map. The result of part (a) follows 
from an analysis of the boundary maps. Part (b) is immediate because the 
maps 4; already induce an isomorphism on the chain level: 6;C,(M;) = 
Cy(M). 

The main step in the proof of homotopy invariance is the construction 
for any space M of a linear map h: C,(M) — Cp41(M x I) satisfying 


hodt+doh=(ii)# — (to)#, (11.15) 


where 13: M — M x I is the injection .;(%) = (z,7). From this it follows 
just as in the proof of Proposition 11.5 that homotopic maps induce the 
same homology homomorphism, and then in turn that homotopy equivalent 
spaces have isomorphic singular cohomology groups. 


In addition to the properties above, singular homology satisfies the fol- 
lowing version of the Mayer—Vietoris theorem. Suppose M is a topological 


294 11. De Rham Cohomology 


space and U,V Cc M are open subsets whose union is WM. The usual diagram 
(11.7) of inclusions induces homology homomorphisms: 


H,(U) 
JN 
Hy(U NV) H,(M) 
NA 
H,(V) (11.16) 


Theorem 11.26 (Mayer—Vietoris for Singular Homology). Let M 
be a topological space and let U,V be open subsets of M whose union is M. 
For each p there is a homomorphism 0,: Hp(M) — Hp-i1(UNV) such that 
the following sequence is exact: 


ean H,(UNV) > H),(U) @ H,(V) 4 H,(M) 


Pee (av) ese nA) 
where 
alc] = (é.[e], —3.[e]), 
B([d, [e']) = kale] + b[e'], 


and O.[e] = [c] provided there exist d € H,(U) and d’ € H,(V) such that 
k,d+1,d' is homologous to e and 


(inc, —jec) = (Od, Od’). 


Sketch of Proof. The basic idea, of course, is to construct a short exact 
sequence of complexes and use the zigzag lemma. The hardest part of the 
proof is showing that any homology class [e] € H,(/) can be represented in 
the form [k,d—l,d'], where d and d’ are chains in U and V, respectively. 


Note that the maps a and @ in this Mayer—Vietoris sequence can be 
replaced by 


and the same proof goes through. We have chosen the definition given 
in the statement of the theorem because it leads to a cohomology exact 
sequence that is compatible with the Mayer—Vietoris sequence for de Rham 
cohomology; see the proof of the de Rham theorem below. 
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Singular Cohomology 


In addition to the singular homology groups, for any abelian group G one 
can define a closely related sequence of groups H?(M;G) called the sin- 
gular cohomology groups with coefficients in G. The precise definition is 
unimportant for our purposes; we will only be concerned with the spe- 
cial case G = R, in which case it can be shown that H?(M;R) is a real 
vector space that is isomorphic to the space Hom(H,(M),R) of group 
homomorphisms from H,(M]/) into R. (If you like, you can take this as 
a definition of H?(M,R).) Any continuous map F: M — N induces a 
linear map F*: H?(N;R) — H?(M;R) by (F*4)[cl = y(Fi[c]) for any 
y € H?(N;R) = Hom(H,(M),R) and any singular p-chain c. The functo- 
rial properties of F, carry over to cohomology: (Go F)* = F* o G* and 
(Idar)* = Ide): 

The following properties of the singular cohomology groups follow easily 
from the definitions and Proposition 11.25. 


Proposition 11.27 (Properties of Singular Cohomology). 


(a) For any one-point space {q}, H?({q};R) is trivial except when p = 0, 
in which case it is one-dimensional. 


(b) If {Mj} is any collection of topological spaces and M = |], Mj, then 
H?(M;R) © TI, H?(Mj:R). 


(c) Homotopy equivalent spaces have isomorphic singular homology 
groups. 


The key fact about the singular cohomology groups that we will need is 
that they too satisfy a Mayer—Vietoris theorem. 


Theorem 11.28 (Mayer—Vietoris for Singular Cohomology). 
Suppose M, U, and V satisfy the hypotheses of Theorem 11.26. For each 
p there is a homomorphism 0*: H?(UNV;R) — H?*!(M;R) such that 
the following sequence is exact: 


. 5, 7°(M:R) © 2", 9°(U;R) © H?(V;R) 25 BP(U NVR) 


*, P+ M;R) E24... (11.18) 


where the maps k* ®l* and 1i*—j* are defined as in (11.9), and O*y = yoO,, 
with 0, as in Theorem 11.26. 


Sketch of Proof. For any homomorphism F': A — B between abelian 
groups, there is a dual homomorphism F*: Hom(B,R) — Hom(A, R) given 
by F*y = yo F. Applying this to the Mayer—Vietoris sequence (11.17) for 
singular homology, we obtain the cohomology sequence (11.18). The exact- 
ness of the resulting sequence is a consequence of the fact that the functor 
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A+ Hom(A,R) is exact, meaning that it takes exact sequences to exact 
sequences. This in turn follows from the fact that R is an injective group: 
Whenever H is a subgroup of an abelian group G, every homomorphism 
from H into R extends to all of G. 


Smooth Singular Homology 


The connection between singular and de Rham cohomology will be estab- 
lished by integrating differential forms over singular chains. More precisely, 
given a singular p-simplex o in a manifold M and a p-form on M, we would 
like to pull w back by o and integrate the resulting form over A,. However, 
there is an immediate problem with this approach, because forms can only 
be pulled back by smooth maps, while singular simplices are in general only 
continuous. (Actually, since only first derivatives of the map appear in the 
formula for the pullback, it would be sufficient to consider C! maps, but 
merely continuous ones definitely will not do.) In this section we overcome 
this problem by showing that singular homology can be computed equally 
well with smooth simplices. 

If M is a smooth manifold, a smooth p-simplex in M is a smooth map 
a: A, — M. The subgroup of C,(M) generated by smooth simplices is 
denoted by Cp°(M) and called the smooth chain group in dimension p; ele- 
ments of this group, which are finite formal linear combinations of smooth 
simplices, are called smooth chains. Because the boundary of a smooth 
simplex is a smooth chain, we can define the pth smooth singular homology 
group of M to be the quotient group 


_ Ker[O: Co°(M) — Cr, (M)] 


AP OS Tal C=, (M) > C(M)] 


The inclusion map 7: C3°(M) — C,(M) obviously commutes with the 
boundary operator, and so induces a map on homology: v.: Hp°(M) — 
H,(M). 


Theorem 11.29. For any smooth manifold M, the map tx: H3°(M) > 
H,(M) induced by inclusion is an isomorphism. 


Proof. |Author’s note: This proof still needs to be written. The basic idea 
is to construct, with the help of the Whitney approximation theorem, a 
smoothing operator s: Cp(M) — Cy°(M) such that so0 =008 and sou 
is the identity on CP? (M), and a homotopy operator that shows that Lo s 
induces the identity map on H,(M).| 
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In this section we will state and prove the de Rham theorem. Before getting 
to the theorem itself, we need one more algebraic lemma. Its proof is another 
diagram chase like the proof of the zigzag lemma. 


Lemma 11.30 (The Five Lemma). Consider the following commuta- 
tive diagram of modules and linear maps: 


Ay ag a a 


Ay Ag ~ As aos Ay 


| lf |f: [1 |F: 
By Br up: Be “Be Bs ie Ba 5B 


fae 


If the horizontal rows are exact and fi, fe, fa, and fs are isomorphisms, 
then fz is also an isomorphism. 


Exercise 11.5. Prove (or look up) this lemma. 


Suppose M is a smooth manifold, w is a closed p-form on M, and o isa 
smooth p-simplex in M. We define the integral of w over a to be 


fo=f Ow. 
o Ap 


This makes sense because A, is a smooth p-submanifold with corners em- 
bedded in R?, and inherits the orientation of R?. (Or we could just consider 
A, as a domain of integration in R’.) (When p = 1, this is the same as 
the line integral of w over the smooth curve segment oc: [0,1] — M.) If 
c= ier co; is a smooth p-chain, the integral of w over c is defined as 


k 
[e= ) af WwW. 
ras =1 on 


Theorem 11.31 (Stokes’s Theorem for Chains). If c is a smooth p- 
chain in a smooth manifold M, and w is a (p—1)-form on M, then 


i. w= f dw. 
Oc c 


Proof. It suffices to prove the theorem when c is just a smooth simplex co. 
Since A, is a manifold with corners, Stokes’s theorem says 


fof td = f dow = f Ow. 
o Ap Ap dA, 


The maps {F;,,:0=1,...,p} are parametrizations of the boundary faces 
of A, satisfying the conditions of Proposition 10.30, except possibly that 
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they might not be orientation preserving. To check the orientations, note 
that F;p is the restriction to A, NOH” of the affine diffeomorphism sending 


(€0,-+-,€p) to (€0,...,€;,---,€p, ei). This is easily seen to be orientation 

preserving if and only if (e0,...,€;,...,€p,é;) is an even permutation of 

€0,---,€p), which is the case if and only if p—7 is even. Since the standard 
, »€p y 


coordinates on OHI? are positively oriented if and only if p is even, the 
upshot is that Fj, is orientation preserving for OA, if and only if 7 is even. 
Thus, by Proposition 10.30. 


Pp 
ow= (1) [ Px o*w 
fae OM [Be 

P a 

=v fo (oFisyw 
i=0 Ap-1 
P A 

=Scu fe» 
i=0 00°F; p 


By definition of the singular boundary operator, this is equal to f aq W: 


Using this theorem, we can define a natural linear map J: H?,(M) > 
H”(M;R), called the de Rham homomorphism, as follows. For any [w] € 
Hip(M) and [ce] € Hp(M) = H3°(M), we define 


J[w] [ce] = fo (11.19) 


where ¢ is any smooth p-cycle representing the homology class [c]. This is 
well-defined, because if ¢ is the boundary of a smooth (p— 1)-chain 6, then 


[- im [es 0, 
é db b 


while if w = dn is exact, then 


[- fan [- 0. 
z z a 


(Note that 0¢ = 0 and dw = 0 because they represent a homology class 
and a cohomology class, respectively.) Clearly J[w][c+c¢'] = J[w][c]+IJ[w][c’], 
and the resulting homomorphism J[w]: H,() — R depends linearly on w. 
Thus J[w] is a well-defined element of Hom(H,(M),R), which we identify 
with H?(M;R). 
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Lemma 11.32. If F: M — N is a smooth map, then the following dia- 
gram commutes: 


F* 


Hip(N) AGp(M) 
| F 
H?(N;R) => H?(MR), 


Proof. Directly from the definitions, if o is a smooth p-simplex in M and 
w is a p-form on JN, 


[re=f oF = [ (Fooyw= [ w. 
o Ap A Foo 


P 


This is equivalent to J(F*[w])[o] = Jlw](F.[o]). 


Lemma 11.33. If M is a smooth manifold and U,V are open subsets of 
M whose union is M, then the following diagram commutes: 


HS (UAV) —°+ HP A(M) 


| 
H?(U NVR) ae H?(MR), (11.20) 


where 6 and O* are the connecting homomorphisms of the Mayer-Vietoris 
sequences for de Rham and singular cohomology, respectively. 


Proof. Identifying H?(M;R) with Hom(H,(M),R) as usual, commutativ- 
ity of (11.20) reduces to the following equation for any [w] € H aia (5) nV) 
and any [e] © H,(M): 


J(d[w}) le] = (O°I[w]) [e] = Jlw] (2. [e)).- 


If o is a (p — 1)-form representing 6[w] and [c] is a p-chain representing 


O.[e], this is the same as 
[o= fo 


By the characterizations of 6 and 0, given in Corollary 11.22 and Theorem 
11.26, we can choose o = di (extended by zero to all of M), where 7 € 
A?(U) and ’ € A?(U) are forms such that w = nlunv—7'|unv; and c = Od, 
where d,d’ are smooth simplices in U and V, respectively, such that d+ d’ 
represents the same homology class as e. Then, because 0d + 0d’ = de = 0 
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and dy|uav — dn'|unv = dw = 0, we have 


Thus the diagram commutes. 


Theorem 11.34 (de Rham). For any smooth manifold M and any 
p > 0, the de Rham homomorphism J: Hip(M) > H?(M;R) is an iso- 
morphism. 


Proof. Let us say that a smooth manifold M is a de Rham manifold if the 
de Rham homomorphism J: H?,(M) — H?(M;R) is an isomorphism for 
each p. Since the de Rham homomorphism commutes with the cohomol- 
ogy maps induced by smooth maps (Lemma 11.32), any manifold that is 
diffeomorphic to a de Rham manifold is also de Rham. The theorem will 
be proved once we show that every smooth manifold is de Rham. 

If M is any smooth manifold, an open cover {U;} of M is called a de 
Rham cover if each open set U; is a de Rham manifold, and every finite 
intersection U;,M---U;, is de Rham. A de Rham cover that is also a 
basis for the topology of M is called a de Rham basis for M. 

STEP 1: If {M,} is any countable collection of de Rham manifolds, then 
their disjoint union is de Rham. By Propositions 11.8 and 11.27(c), for 
both de Rham and singular cohomology, the inclusions 1;: M; @ I] 5 M; 
induce isomorphisms between the cohomology groups of the disjoint union 
and the direct product of the cohomology groups of the manifolds M;. By 
Lemma 11.32, J respects these isomorphisms. 

STEP 2: Every convex open subset of R” is de Rham. Let U be such 
a subset. By the Poincaré lemma, H4,(U) is trivial when p #4 0. Since 
U is homotopy equivalent to a one-point space, Proposition 11.27 implies 
that the singular cohomology groups of U are also trivial for p 4 0. In the 
p = 0 case, H!,,(U) is the one-dimensional space consisting of the constant 
functions, and H°(U;IR) = Hom(Ho(U),R) is also one-dimensional because 
H(U) is spanned by any singular 0-simplex. If a: Ag > M is a 0-simplex 


The de Rham Theorem 301 


(which is smooth because any map from a 0-manifold is smooth), and f is 
the constant function equal to 1, then 


atfllel = f, otf =(foo)(0) =1. 


This shows that J: H9,(U) — H°(U;R) is not the zero map, so it is an 
isomorphism. 

STEP 3: If M has a finite de Rham cover, then M is de Rham. Suppose 
M =U,U---UU,, where the open sets U; and their finite intersections are 
de Rham. We will prove the result by induction on k. Suppose first that I 
has a de Rham cover consisting of two sets {U,V}. Putting together the 
Mayer-Vietoris sequences for de Rham and singular cohomology, we obtain 
the following commutative diagram in which the horizontal rows are exact 
and the vertical maps are all given by de Rham homomorphisms: 


Ht, (U) ® Hin’ (V) ——> Hin’ (U NV) —~> Hip(M) — 


Y 
HP-(U;R) © HP-\(V;R) —> HP-(U NVR) — HP(M3R) 


Hip) © Hip(V) — Hip(U NV) 


bf 
H?(U;R) © H?(V;R) + H?(U NV:R). 


The commutativity of the diagram is an immediate consequence of Lemmas 
11.32 and 11.33. By hypothesis the first, second, fourth, and fifth vertical 
maps are all isomorphisms, so by the five lemma the middle map is an 
isomorphism, which proves that M is de Rham. 

Now assume the claim is true for smooth manifolds admitting a de Rham 
cover with k > 2 sets, and suppose {Uj,...,Ux41} is a de Rham cover of 
M. Put U =U,U---UU, and V = Ux41. The hypothesis implies that U 
and V are de Rham, and so is UNV because it has a k-fold de Rham cover 
given by {U1 NU g41,..-,U% VU 41}. Thus M = UUV is also de Rham by 
the argument above. 

STEP 4: If M has a de Rham basis, then M is de Rham. Suppose {Uq} 
is a de Rham basis for M. Let f: M — R be a continuous proper function, 
such as the one constructed in the proof of the Whitney embedding theorem 
(Theorem 6.12). For each integer m, define subsets A, and A’, of M by 


Ae a CIS HO SE 
Al, ={qeM:m-—$< f(g <m+1+ 3}. 


For each point gq € Am, there is a basis open set containing g and contained 
in Aj. The collection of all such basis sets is an open cover of A,,. Since 
f is proper, A,;, is compact, and therefore it is covered by finitely many of 
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these basis sets. Let B,, be the union of this finite collection of sets. This 
is a finite de Rham cover of B,,, so by Step 3, By is de Rham. 
Observe that By, C A‘, so Bm intersects By, nontrivially only when 


m=m-—1,m, orm-+1. Therefore, if we define 


ese (| Bees 


m odd 


then U is de Rham by Step 1, because it is the disjoint union of the de 
Rham manifolds B,,. Similarly, 


Was U) Be 


m even 


is de Rham. Finally, M =U UV is de Rham by Step 3. 

STEP 5: Any open subset of R” is de Rham. If U C R” is such a subset, 
then U has a basis consisting of Euclidean balls. Because each ball is convex, 
it is de Rham, and because any finite intersection of balls is again convex, 
finite intersections are also de Rham. Thus U has a de Rham basis, so it is 
de Rham by Step 4. 

STEP 6: Every smooth manifold is de Rham. Any smooth manifold has a 
basis of coordinate domains. Since every coordinate domain is diffeomorphic 
to an open subset of R”, as are finite intersections of coordinate domains, 
this is a de Rham basis. The claim therefore follows from Step 4. 


This result expresses a deep connection between the topological and ana- 
lytic properties of a smooth manifold, and plays a central role in differential 
geometry. If one has some information about the topology of a manifold 
M, the de Rham theorem can be used to draw conclusions about solutions 
to differential equations such as dy = w on M. Conversely, if one can prove 
that such solutions do or do not exist, then one can draw conclusions about 
the topology. 
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Problems 


11-1. 


Let M be a smooth manifold, and w € A?(M), 7 € A%(M). Show 
that the de Rham cohomology class of w A 7 depends only on the de 
Rham cohomology classes of w and 7, and thus there is a well-defined 
bilinear map U: H?,(M) x H4,(M) — H25%(M) given by 


[4] U [n] = [w An). 


(This bilinear map is called the cup product.) 


. Let M be an orientable smooth manifold and suppose w is a closed 


p-form on M. 


(a) Show that w is exact if and only if the integral of w over every 
smooth p-cycle is zero. 


(b) Now suppose that H,(M) is generated by the homology classes 
of finitely many smooth p-cycles {ci1,...,¢m}. The numbers 
P,(w),..., Pm(w) defined by 


are called the periods of w with respect to this set of generators. 
Show that w is exact if and only if all of its periods are zero. 


. Let M be a smooth n-manifold and suppose S C M is an immersed, 


compact, oriented, p-dimensional submanifold. A smooth triangula- 
tion of S is a smooth p-cycle c = >, nio; in M with the following 
properties: 


e Each 9; is an orientation-preserving embedding of A, into S. 

e Ift Jj, then o;(Int A,) N90; (Int A,) = 2. 

e S=U; oi(Ap). 
(It can be shown that every smooth orientable submanifold admits 
a smooth triangulation, but we will not use that fact.) Two p- 
dimensional submanifolds $,S’ C M are said to be homologous if 


there exist smooth triangulations c for S' and c’ for S’ such that c—c’ 
is a boundary. 


(a) If c is a smooth triangulation of S and w is any p-form on M, 
show that fiw = fw. 


(b) If w is closed and S$, $’ are homologous, show that fw = fg, w. 
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11-4. Suppose (M,g) is a Riemannian n-manifold. A p-form w on M is 


11-7. 


11-9. 


called a calibration if w is closed and wg(X1,...,Xp) < 1 whenever 
(X1,...,X,p) are orthonormal vectors in some tangent space T,M. 
A smooth submanifold S C M is said to be calibrated if there is a 
calibration w such that w|s is the volume form for the induced Rie- 
mannian metric on S. If S C M is asmoothly triangulated calibrated 
submanifold, show that the volume of S (with respect to the induced 
Riemannian metric) is less than or equal to that of any other subman- 
ifold homologous to S. (Calibrations were invented in 1985 by Reese 
Harvey and Blaine Lawson [HL82]; they have become increasingly 
important in recent years because in many situations a calibration is 
the only known way of proving that a given submanifold is volume 
minimizing in its homology class.) 


. Let D C R® be the torus of revolution obtained by revolving the circle 


(x — 2)? + 2? = 1 around the z-axis, with the induced Riemannian 
metric. Show that the inner circle {(z,y,z) : z = 0,27 + y? = 1} 
is calibrated, and therefore has the shortest length in its homology 
class. 


. Let M be a compact, connected, orientable, smooth n-manifold, and 


let p be any point of M. Let V be a neighborhood of p diffeomorphic 
to R” and let U = M ~ {p}. 


(a) Show that the connecting homomorphism 6: Hiz'(U NV) > 
H"',(M) is an isomorphism. [Hint: Consider the (n — 1)-form w 
on UNV & R"\ {0} defined in coordinates by (10.18) (Problem 
10-10).] 


(b) Use the Mayer—Vietoris sequence of {U,V} to show that 
Hp(M ~ {p}) = 90. 


Let M be a compact, connected, smooth manifold of dimension 
n > 3. For any p © M and O < k& < n, show that the map 
H*,.(M) — H*p(M ~ {p}) induced by inclusion M \ {p} @ M 
is an isomorphism. [Hint: Use a Mayer—Vietoris sequence together 
with the result of Problem 11-6. The cases k = 1 and k = n—1 will 


require special handling. ] 


. Let M1, Mz be smooth, connected, orientable manifolds of dimension 


n > 2, and let Mi#My2 denote their smooth connected sum (see 
Problem 5-20). Show that H¥,(Mi#M2) = Hk, (M1) ¢ Hk p(M2) for 
O<k<n. 


Suppose (M,w) is a 2n-dimensional symplectic manifold. 


(a) Show that w” =wA---Aw (the n-fold wedge product of w with 
itself) is not exact. [Hint: See Problem 10-21.] 
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(b) Show that H24(M) #0 for k=1,...,n. 


(c) Show that the only sphere that admits a symplectic structure is 
S* 


306 11. De Rham Cohomology 


12 


Integral Curves and Flows 


In this chapter, we begin to explore vector fields in more depth. The primary 
objects associated with vector fields are “integral curves,” which are smooth 
curves whose tangent vector at each point is equal to the value of the 
vector field there. We will show in this chapter that a vector field on a 
manifold determines a unique integral curve through each point; the proof 
is an application of the existence and uniqueness theorem for solutions of 
ordinary differential equations. 

The collection of all integral curves of a given vector field on a manifold 
determines a family of diffeomorphisms of (open subsets of) the manifold, 
called a “flow.” Any smooth R-action is a flow, for example; but we will 
see that there are flows that are not R-actions because the diffeomorphisms 
may not be defined for all ¢t € R or all points in the manifold. 

In subsequent chapters, we will begin to study some of the profound 
applications of these ideas. 


Integral Curves 


A smooth curve y: J > M determines a tangent vector 7'(t) € Ty()M at 
each point of the curve. In this section we describe a way to work backwards: 
Given a tangent vector at each point, we seek a curve that has those tangent 
vectors. 

Let M be a smooth manifold and let V be a smooth vector field on M. 
An integral curve of V is a smooth curve y: J — M defined on an open 
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interval J C R such that 
'() =Vyay tor allt € J. 


In other words, the tangent vector to y at each point is equal to the value 
of V at that point. If 0 € J, the point p = 7(0) is called the starting point 
of y. (The reason for the term “integral curve” will be explained shortly.) 


Example 12.1 (Integral Curves). 


(a) Let V = 0/0z be the first coordinate vector field on R?. It is easy to 
check that any curve of the form y(t) = (t+ a,b) for constants a and 
b is an integral curve of V, satisfying y(0) = (a,b). Thus there is an 
integral curve passing through each point of the plane. 


(b) Let W = 20/0x + yO/dy on R?. If y: R > R? is a smooth curve, 
written in standard coordinates as y(t) = (x(t), y(t)), then the con- 
dition y(t) = W,(z) for y to be an integral curve translates to 


a) a) 
x(t) az + y'(t) ay 
© \(a(t),y(t)) Y \(a(t),y(t)) 
fa) fa) 
© | (e(t),y(t)) Y\(a(t),y(t)) 


Comparing the components of these vectors, this is equivalent to the 
pair of ordinary differential equations 


a'(t) = x(t), 
y'(t) = y(t). 
These equations have the solutions x(t) = ae’ and y(t) = be’ 


for arbitrary constants a and b, and thus each curve of the form 
7(t) = (ae’, bet) is an integral curve of W. Since (0) = (a,b), we see 
once again that there is an integral curve passing through each point 
(a,b) € R? 


As the second example above illustrates, finding integral curves boils 
down to solving a system of ordinary differential equations in a coordinate 
chart. More generally, let y: J — M be any smooth curve. Writing y in 
local coordinates as y(t) = (71(t),...,7"(t)), the condition 7/(t) = Vy) 
that y be an integral curve of a smooth vector field V can be written in 
local coordinates on an open set U as 


: ra) F 0 
(YQ a5 =V*09@) zal > 
Ox" |(4) Ox" | (4) 
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which reduces to the system of ordinary differential equations (ODEs) 


(VO =ViO'O) 9"), 


(YO =V"VO,--- 7"), 


where the component functions V* are smooth on U. The fundamental 
fact about such systems, which we will state precisely and prove later 
in the chapter, is that there is a unique solution, at least for t in a 
small time interval (—e,¢), satisfying any initial condition of the form 
(y1(0),...,7"(0)) = (a1,...,a”) for (a1,...,a") € U. (This the reason 
for the terminology “integral curves,” because solving a system of ODEs 
is often referred to as “integrating” the system.) For now, we just note 
that this implies there is a unique integral curve, at least for a short time, 
starting at any point in the manifold. Moreover, we will see that up to 
reparametrization, there is a unique integral curve passing through each 
point. 

The following simple lemma shows how an integral curve can be repara- 
metrized to change its starting point. 


Lemma 12.2 (Translation Lemma). Let V be a smooth vector field on 
a smooth manifold M, let J CR be an open interval containing 0, and let 
vy: J > M be an integral curve of V. For anya € J, let J={t ER: 
t+a€ J}. Then the curve ¥: J > M defined by F(t) = y(t + a) is an 
integral curve of V starting at y(a). 


Proof. One way to see this is as a straightforward application of the chain 
rule in local coordinates. Somewhat more invariantly, we can examine the 
action of ¥’(¢) on a smooth function f defined in a neighborhood of a point 
(to). By the chain rule and the fact that y is an integral curve, 


V(to)f = = 


(for t+ a) 


= (fo 7)'(to +4) 
= 7' (to + a) f = Vi (to+a) f = Vy(t0) f- 


Thus ¥ is an integral curve of V. 


Flows 


There is another way to visualize the family of integral curves associated 
with a vector field. Let V be a vector field on a smooth manifold M, and 
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suppose it has the property that for each point p € M there is a unique 
integral curve 0): R > M starting at p. (It may not always be the case 
that all of the integral curves are defined for all t € R, but for purposes of 
illustration let us assume for the time being that they are.) For each t € R, 
we can define a map @; from M to itself by sending each point p € M to 
the point obtained by following the curve starting at p for time t: 


O:(p) = 0 (t). 


This defines a family of maps 6,: M — M for t € R. If g = 0)(s), the 
translation lemma implies that the integral curve starting at q satisfies 
0 (t) = 9) (t + s). When we translate this into a statement about the 
maps 6;, it becomes 


0,005 (p) = O45 (p). 


Together with the equation 09(p) = 6) (0) = p, which holds by definition, 
this implies that the map 6: Rx M — M is an action of the additive group 
Ron M. 

Motivated by these considerations, we define a global flow on M (some- 
times also called a one-parameter group action) to be a smooth left action 
of R on WM; that is, asmooth map @: R x M — M satisfying the following 
properties for all s,t € R and all pe M: 


A(t, O(s,p)) = O(t + 5, p), 
0(0, p) =p. 


Given a global flow 6 on M, we define two collections of maps as follows. 


(12.1) 


e For each t € R, define 0:: M — M by 
The defining properties (12.1) are equivalent to the group laws: 


O, oe) 6, => O45, 


12.2 
09 = Idyy . ( ) 


As is the case for any smooth group action, each map 6: M — M is 
a diffeomorphism. 


For each p € M, define a smooth curve 6): R > M by 
9°) (t) = O(t, p). 


The image of this curve is just the orbit of p under the group action. 
Because any group action on a set partitions the set into disjoint 
orbits, it follows that M is the disjoint union of the images of these 
curves. 
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The next proposition shows that every global flow arises as the set of 
integral curves of some vector field. 
Proposition 12.3. Let 0: Rx M— M be a global flow. For eachp € M, 
define a tangent vector V, € T,M by 


Vp = 0)'(0) = g 


t=0 
The assignment p'> V, is a smooth vector field on M, and each curve g(P) 


is an integral curve of V. 


The vector field V defined in this proposition is called the infinitesimal 
generator of @, for reasons we will explain below. 


Proof. To show that V is smooth, it suffices to show that Vf is smooth 
for any smooth function f defined on an open subset of M. For any such 
function, just note that 


Vil) =Vef =O OF =<] sOW) =F] fot). 


t=0 t=0 


Because 6(t,p) depends smoothly on (t,p), so does f(0(t, p)) by composi- 
tion, and therefore so also does the derivative of f(0(t,p)) with respect to 
t. (You can interpret this derivative as the action of the smooth vector field 
0/Ot on the smooth function fod: Rx M — R.) If follows that V f(p) 
depends smoothly on p, so V is smooth. 

To show that 6) is an integral curve of V, we need to show that 


g)'(t) — Vow) (t) 


for all p € M and allt € R. Let to € R be arbitrary, and set gq = 0) (to) = 
61,(p), so that what we have to show is 0)'(t9) = Vy. By the group law, 
for all t, 


a (t) = 0:(q) 
= 4481, (p)) 
= 914t9 (Pp) 
= Q) (t +4 to). 
Therefore, for any smooth function f defined in a neighborhood of q, 


Vaf = 0" (0) f 
=—| 0%) 


t=0 


=F] FEM +t) 
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which was to be shown. 


Another important property of the infinitesimal generator is that it is 
invariant under the flow, in the following sense. Let V be a smooth vector 
field on a smooth manifold M, and let F: M — M be a diffeomorphism. 
We say that V is invariant under F if F.V = V. Unwinding the definition 
of the push-forward of a vector field, this means that for each p € M, 
E.Vp = Ver(p)- 


Proposition 12.4. Let 6 be a global flow on M and let V be its infinites- 
imal generator. Then V is invariant under 6; for each t € R. 


Proof. Let p € M and to € R be arbitrary, and set q = 0;,(p). We need to 
show that 


(919 )*Vp = Vq- 


Applying the left-hand side to a smooth function f defined in a neighbor- 
hood of g and using the definition of V, we obtain 


(9t0%Vp)F = Vol © Ft) 


=—| fob, od)(t) 
t=0 


= ae F(t (91 (P)) 


t=0 


=a Ff (Ato +4(P)) 


t=0 


==) £0 (to+8) 
t=0 


= 0°)" (t9) f. 


Since 6) is an integral curve of V, 0)’(to) = Vo. 


We have seen that every global flow gives rise to a smooth vector field 
whose integral curves are precisely the curves defined by the flow. Con- 
versely, we would like to be able to say that every smooth vector field is 
the infinitesimal generator of a global flow. However, it is easy to see that 
this cannot be the case, because there are vector fields whose integral curves 
are not defined for all t € R, as the following examples show. 


Example 12.5. Let M = {(z,y) € R?: x < O}, and let V = 0/dz. Rea- 
soning as in Example 12.1(a), we see that the integral curve of V starting 
at (a,b) € M is y(t) = (t + a,b). However, in this case, y is defined only 
for t < —a. 
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Example 12.6. For a somewhat more subtle example, let M be all of R? 
and let W = x70/0z. You can check easily that the unique integral curve 
of W starting at (1,0) is 


This curve is defined only for t < 1. 


For this reason, we make the following definitions. If M is a smooth 
manifold, a flow domain for M is an open subset D C R x M with the 
property that for each p € M, the set D, = {t € R: (t,p) € D} is an open 
interval containing 0. A flow on M is a smooth map 0: D — M, where 
D CRx M is a flow domain, that satisfies 


6(0,p) = p for allpe M, 
A(t, O(s,p)) = O(t + s,p) whenever s € Dy and t € Doys.p). 


We sometimes call 6 a local flow to distinguish it from a global flow as 
defined earlier. The unwieldy term local one-parameter group action is also 
commonly used. 

If 6 is a flow, we define 6;(p) = 0) (t) = 0(t, p) whenever (t,p) € D, just 
as for a local flow. Similarly, the infinitesimal generator of @ is defined by 
Vp = 9°)'(0). 


Lemma 12.7 (Properties of Flows). Let D be a flow domain for M, 
and let 0: D— M be a flow. 


(a) For eacht € R, the set M; = {p © M: (t,p) € D} is open in M, and 
0:: Mt — M is a diffeomorphism onto an open subset of M. 


(b) The following relation holds whenever the left-hand side is defined: 


O; ie) 6, = O45. 


(c) The infinitesimal generator V of 0 is a smooth vector field. 


(d) For each p€ M, 0: D, — M is an integral curve of V starting at 
D. 


(e) For eacht €R, 0:,V =V on the open set 6,(M;). 
Exercise 12.1. Prove Lemma 12.7. 


Another important property of flows is the following. 


Lemma 12.8. Suppose @ is a flow on M with infinitesimal generator V, 
andp € M. If Vp =0, then 0) is the constant curve 6) (t) = p. If V, #0, 
then 0) : Dy, — M is an immersion. 
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Proof. For simplicity, write y = 6). Let t € D, be arbitrary, and put 
q = y(t). Note that the push-forward y,: T/R— T,M is zero if and only 
if 7/(t) = 0. Part (e) of Lemma 12.7 shows that V, = 0;.Vp. Therefore 
y(t) = Vg = 0 if and only if y'/(0) = V, = 0; in other words, if 7’(t) 
vanishes for some t € D, it vanishes for all such ¢. Thus if V, = 0, then + 
is a smooth map whose push-forward at each point is zero, which implies 
that it is a constant map (because its domain is connected). On the other 
hand, if V, # 0, then +, is nonzero, hence injective, at each point, so 7 is 
an immersion. 


The Fundamental Theorem on Flows 


In this section we will see that every smooth vector field gives rise to a 
flow, which is unique if we require it to be mazimal, which means that it 
cannot be extended to any larger flow domain. 


Theorem 12.9 (Fundamental Theorem on Flows). Let V be a 
smooth vector field on a smooth manifold M. There is a unique maximal 
flow whose infinitesimal generator is V. 


The flow whose existence is asserted in this theorem is called the flow 
generated by V. 

The term “infinitesimal generator” comes from the following picture. In 
a local coordinate chart, a reasonably good approximation to the flow can 
be obtained by composing very many small affine translations, with the 
direction and length of each successive motion determined by the value 
of the vector field at the point arrived at in the previous step. Long ago, 
mathematicians thought of a flow as being composed of infinitely many 
infinitesimally small linear steps. 

As we saw earlier in this chapter, finding integral curves of V (and there- 
fore finding the flow generated by V) boils down to solving a system of 
ordinary differential equations, at least locally. Thus before beginning the 
proof, let us state a basic theorem about solutions of ordinary differential 
equations. We will give the proof of this theorem in the last section of the 
chapter. 


Theorem 12.10 (ODE Existence, Uniqueness, and Smoothness). 
Let U C R” be open, and let V: U — R” be a smooth map. For any 
(to,%0) € Rx U and any sufficiently small ¢ > 0, there exist an open set 
Uo CU containing xo and a smooth map 6: (to — €,to +e) x Up > U such 
that for each x € Up, the curve y(t) = O(t,x) is the unique solution on 
(to — €,to +) to the initial-value problem 


y(t) = Vi), 


a (to) ee 
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Using this result, we now prove the fundamental theorem on flows. 


Proof of Theorem 12.9. We begin by noting that the existence assertion 
of the ODE theorem implies that there exists an integral curve starting at 
each point p € M, because the equation for an integral curve is a system 
of ODEs in any local coordinates around p. 
Now suppose 7,7: J — M are two integral curves defined on the same 
pen interval J such that 7(to) = ¥(to) for some to € J. Let § be the set 
ft € J such that y(t) = ¥7(t). Clearly & is nonempty because to € 8 by 
ypothesis, and it is closed in J by continuity. On the other hand, suppose 
t; € 8. Then in a coordinate neighborhood around the point p = y(t1), 
y and ¥ are both solutions to same ODE with the same initial condition 
(ti) = 7(t1) = p. By the ODE theorem, there is a neighborhood (t, — 
€,t; + €) of t; on which there is a unique solution to this initial-value 
problem. Thus y = ¥ on (t1; — €,t1; +¢), which implies that S$ is open in J. 
Since J is connected, § = J, which implies that -~ = ¥ on all of J. Thus any 
two integral curves that agree at one point agree on their common domain. 

For each p € M, let D, be the union of all intervals J C R containing 0 
on which an integral curve starting at p is defined. Define 6): D, — M 
by letting 9) (t) = 7(t), where ¥ is any integral curve starting at p and 
defined on an open interval containing 0 and t. Since all such integral curves 
agree at t by the argument above, 6) is well defined, and is obviously the 
unique maximal integral curve starting at p. 

Now, let D(V) = {(t,p) € Rx M: t € Dp}, and define 6: D(V) — M by 
O(t,p) = 0)(t). As usual, we also write 0;(p) = 6(t, p). Clearly 99 = Idyy 
by definition. We will verify that 6 satisfies the group law 


9 0 Os(p) = O148(P) (12.3) 


whenever the left-hand side is defined. Fix any s € D,, and write q = 
0,(p) = 0)(s). Define 7(t) = O45(p) = 0)(t + s) wherever the latter 
is defined. Then 7(0) = q, and the translation lemma shows that y is 
an integral curve of V. Thus by the uniqueness assertion above y must be 
equal to 6‘ wherever both are defined, which shows that (12.3) holds when 
both sides are defined. Since both 6) and 0 are maximal, it follows that 
t € D, if and only ift+s € D,; in particular, if the left-hand side of (12.3) 
is defined, then so is the right-hand side. 

Next we will show that D(V) is open in Rx M and that 0: D(V) — M is 
smooth. This implies D(V) is a flow domain; since it is obviously maximal 
by definition, this will complete the proof. 

Define a subset W C D(V) as the set of all (t,p) € D(V) such that 0 
is defined and smooth on a product open set J x U C Rx M, where J 
is an open interval containing 0 and t. Clearly W is open in R x M and 
the restriction of 6 to W is smooth, so it suffices to show that W = D(V). 
Suppose this is not the case. Then there exists some point (to, po) € D(V)\ 
W. For simplicity, let us assume to > 0; the argument for to < 0 is similar. 


zo oO 
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Let 7 = sup{t € R: (t,po) € W}. By the ODE theorem (applied in 
coordinates around po), @ is defined and smooth in some neighborhood of 
(0,po), so T > 0. Let qo = 0°)(r). By the ODE theorem again, there is 
some € > 0 and a neighborhood Up of qo such that @: (—e,e) x Up — M 
is defined and smooth. We will use the group law to show that @ extends 
smoothly to a neighborhood of (7, po), which contradicts our choice of T. 

Choose some t; < 7 such that t; +e > 7 and 6[9)(t,) € Up. Since ty, <7, 
(t1, Po) € W, and so there is a product neighborhood (—0d,t; + 6) x U1 of 
(t1, po) on which @ is defined and smooth. Because 6(t1, 9) € Uo, we can 
choose U; small enough that 6 maps {t;} x U; into Up. Because @ satisfies 
the group law, we have 


9:(p) = 91-41 © 92, (p) 


whenever the right-hand side is defined. By our choice of t;, 6;, (p) is defined 
for p € U,, and depends smoothly on p. Moreover, since 6:,(p) € Uo for 
all such p, it follows that 6:4, o 6;,(p) is defined whenever p € U, and 
|t — t1| < €, and depends smoothly on (t, p). This gives a smooth extension 
of 6 to the product set (—d,t1 +¢) x U1, which contradicts our choice of T. 
This completes the proof that W = D(V). 


Complete Vector Fields 


As we noted above, not every vector field generates a global flow. The ones 
that do are important enough to deserve a name. We say a vector field is 
complete if it generates a global flow, or equivalently if each of its integral 
curves is defined for all t € R. 

It is not always easy to determine by looking at a vector field whether 
it is complete or not. If you can solve the ODE explicitly to find all of 
the integral curves, and they all exist for all time, then the vector field is 
complete. On the other hand, if you can find a single integral curve that 
cannot be extended to all of R, as we did for the vector field of Example 
12.6, then it is not complete. However, it is often impossible to solve the 
ODE explicitly, so it is useful to have some general criteria for determining 
when a vector field is complete. 

In this section we will show that all vector fields on a compact manifold 
are complete. (Problem 12-1 gives a more general sufficient condition.) The 
proof will be based on the following lemma. 


Lemma 12.11 (Escape Lemma). Let M be a smooth manifold and let 
V be a vector field on M. If y is an integral curve of V whose maximal 
domain is not all of R, then the image of y cannot lie in any compact subset 
of M. 


Proof. Suppose y is defined on a maximal domain of the form (a,b), and 
assume that b < co. (The argument for the case a > —oo is similar.) We 
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will show that if 7[0, 0) lies in a compact set, then 7 can be extended past 
b, which is a contradiction. 

Let p = 7(0) and let @ denote the flow of V, so y = 6) by the uniqueness 
of integral curves. If {t;} is any sequence of times approaching b from below, 
then the sequence {¥(t;)} lies in a compact subset of M, and therefore has 
a subsequence converging to a point g € M. Choose a neighborhood U of 
q and a positive number ¢€ such that 0 is defined on (—e,¢) x U. Pick some 
i large enough that y(t;) € U and t; > b—«, and define o: [0,t; +2) ~ M 
by 


t <t<b 
O12, © 4, (p), #—-eE<t<tijt+e. 


These two definitions agree where they overlap, because 0:—1, © 61;(p) = 
6:(p) = y(t) by the group law for 6. Thus o is an integral curve extending 
y, which contradicts the maximality of y. Therefore, y[0,b) cannot lie in 
any compact set. 


Theorem 12.12. Suppose M is a compact manifold. Then every vector 
field on M is complete. 


Proof. If M is compact, the escape lemma implies that no integral curve 
can have a maximal domain that is not all of R, because the image of any 
integral curve is contained in the compact set M. 


Proof of the ODE Theorem 


In this section we prove the ODE existence, uniqueness, and smoothness 
theorem (Theorem 12.10). Actually, it will be useful to prove the existence 
theorem under the somewhat weaker hypothesis that the vector field is only 
Lipschitz continuous (see the Appendix). 

We will prove Theorem 12.10 in several parts: The uniqueness assertion 
follows from the next theorem, existence from Theorem 12.13, and smooth- 
ness from Theorem 12.16. Throughout this section, U C R” will be an open 
set, and V: U — R” will be a Lipschitz continuous map. For any to € R 
and any x € U we will study the initial-value problem 


W'() = Vi), 


: 12.4 
y'(to) = 2". ee 


Theorem 12.13 (Uniqueness of ODE Solutions). Any two solutions 
to (12.4) are equal on their common domain. 
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Proof. Suppose y,y: J — U are two solutions to the ODE on the same 
open interval J, not necessarily with the same initial conditions. The 
Schwartz inequality and the Lipschitz estimate for V imply 


“rt — y(t)? = 204) — 1) « (VAM) -— Vor) 
S 2174) — ¥E))IIVO®) -— VOM 


< 2C/F() — (0) /?. 


It follows easily that 


SCPC) — UA)P) <0, 


and so 
e 2C*H(t) — y(H)? < eP (F(t) — (to)? ¢ 2 to. 
Similarly, using the estimate 


Hil) — 1)? > 20) — HP, 


we conclude that 
ePA(t) — y(t)? < €°°"|F(to) — y(to)I?, tS to. 
Putting these two estimates together, we obtain the following for all t € J: 
V(t) — y(#)| < eC *! (to) — (to) |. (12.5) 


Thus (to) = ¥(to) implies y = ¥ on all of J. 


Theorem 12.14 (Existence of ODE Solutions). For each tp € R and 
zo € U, there exist an open interval Jo containing to, an open set Up C 
U containing xo, and for each x € Uo a differentiable curve y: Jo — U 
satisfying the initial-value problem (12.4). 


Proof. If y is any continuous curve in U, the fundamental theorem of cal- 
culus implies that + is a solution to the initial-value problem (12.4) if and 
only if it satisfies the integral equation 


t 
y(t) = 2 +f V(y(s)) ds, (12.6) 

to 
where the integral of the vector-valued function V(7(s)) is obtained by 


integrating each component separately. For any such y we define a new 
curve Ay by 


Ay(t) =a2+ ih V(y(s)) ds. (12.7) 
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Then we are led to seek a fixed point for A in a suitable metric space of 
curves. 

Let C be a Lipschitz constant for V. Given to € R and xo € U, choose 
r > 0 such that B,(a9) C U, and let M be the supremum of |V(a)| on 
B,(xo). Set Jo = (to —€, to +€) and Up = Bs(xo), where € and 6 are chosen 
small enough that 


6< €< mi Re 
min art peat ‘ 
ies ? IM’C 


For any x € Up, let Mz denote the set of all continuous maps 7: Jo > 
B,(xo) satisfying 7(0) = x. We define a metric on M, by 


No] 3 


dy, 7) = sup |7(t) — ¥4)I- 
te Jo 


Any sequence of maps in M, that is Cauchy in this metric is uniformly 
convergent, and therefore has a continuous limit . Clearly, the conditions 
that y take its values in B,(ao9) and y(0) = x are preserved in the limit. 
Therefore, 1, is a complete metric space. 

We wish to define a map A: M, — Mz, by formula (12.7). The first 
thing we need to verify is that A really does map M, into itself. It is clear 
from the definition that Ay(0) = x and Ay is continuous (in fact, it is 
differentiable by the fundamental theorem of calculus). Thus we need only 
check that Ay takes its values in B,(a9). If y € Mz, then for any t € Jo, 


|Ay(t) — 0] = e+ f vos Ora 


< | —aol+ f Ves ))| ds 
<6+Me<r 
by our choice of 6 and «. 


Next we check that A is a contraction. If y,y € Mz, then using the 
Lipschitz condition on V, we obtain 
d( Ay, AY) = sup 


su Le (sds f Ves) as 


< sup ‘Wer (s)) — V(7(s))| ds 


te Jo / to 


t 
< sup | Cly(s) — 7(s)| ds 


te Jo / to 


< Ced(y,7). 
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Because we have chosen ¢€ so that Ce < 1, this shows that A is a contraction. 
By the contraction lemma, A has a fixed point y € Mz, which is a solution 
to (12.4). 


As a preliminary step in proving smoothness of the solution, we need the 
following continuity result. 


Lemma 12.15 (Continuity of ODE Solutions). Suppose Jo is an 
open interval containing to, Up C U is an open set, and @: Jo x Up — U is 
any map such that for each x € Up, y(t) = O(t, x) solves (12.4). Then 6 is 
continuous. 


Proof. It suffices to show that 6 is continuous in a neighborhood of each 
point, so by shrinking Jo and Up slightly we might as well assume that Jo 
is precompact in R and Up is precompact in U. 

First we note that @ is Lipschitz continuous in x, with a constant that is 
independent of t, because (12.5) implies 


|A(t, Z) — A(t, x)| < eC? |Z—-a| for all z,% € Up, (12.8) 


where T = supz, |t — tol. 

Now let (t,x), (t,%) € Jo x Up be arbitrary. Using the fact that every 
solution to the initial-value problem satisfies the integral equation (12.6), 
we find 


|O(t, Z) — O(t,x)| < |Z- al + ; V(6(s,2)) ds — f V(0(s, x)) ds 


to to 
t 


< |g - 2] 4 |\V(A(s, %)) — V(O(s, x))| ds 


+ | V(0(s,Z))| ds 


t t 
< |@-2|+C 0(s,%) — 6(5,2)]ds +f M ds 
t 


to 
< |¥ — 2] + CTe C7 |Z — x| + Mt — tl, 


where M is the supremum of |V| on Uo. It follows that @ is continuous. 


Theorem 12.16 (Smoothness of ODE Solutions). Let 0 be as in the 
preceding theorem. If V is smooth, then so is 0. 


Proof. We will prove the following claim by induction on k: 


If V is of class C**1, then 6 is of class C*. (12.9) 
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From this it follows that if V is smooth, then @ is of class C* for every k, 
and thus is smooth. 

The hardest part of the proof is the k = 1 step. Expressed in terms of 0, 
the initial-value problem (12.4) reads 


OP _ yi 
5o (0) = V"(O(t, 2), (12.10) 
6° (to, x) =e 


Let us pretend for a moment that everything in sight is smooth, and dif- 
ferentiate both of these equations with respect to x’. Since mixed partial 
derivatives of smooth functions commute, we obtain 


0 06" OV? 06k 
Di Dal | ,t) = ar (OC »t)) a F(t, 2), 


0" 1 ifi=j 
—(to,r) = 6) = ; 
api (to ®) = 6; {) if if j. 


The idea of the proof for k = 1 is to show that the partial derivatives 
00° /Ox) exist and solve this system of equations, and then to use the con- 
tinuity lemma to conclude that these partial derivatives are continuous. 

To that end, we let G: U — M(n,R) denote the matrix-valued function 
G(x) = DV(a). The assumption that V is C? implies that G is C1, so 
(shrinking Uo if necessary) the map Up x M(n,R) — M(n,R) given by 
(x,y) — G(x)y is Lipschitz. 

Consider the following initial-value problem for the n + n? unknown 
functions (6°, 7%): 


a 
apt #) = V(0(t,z)), 
< wth x) = G(O(t, «)) b(t, x); (12.11) 
A(to, x) =a, 
w(to, x) = In. 


This system is called the variational equation for the system (12.4). By 
the existence and continuity theorems, for any x € Up there exist an 
interval J; C Jo containing to, a neighborhood of (29, In) in Uo x M(n,R) 
(which we may assume to be a product set U; x W,1), and a continuous 
map (0,%): J; x Uy x W, — Up x M(n,R) satisfying (12.11). If we can 
show that 06’/dx/ exists for each i and j and equals ~% on J; x U1, then 
00' /Ox? is continuous there. Moreover, (12.4) implies 06*/0t is continuous, 
so it follows that @ is C', at least on the set J, x U4. 

We will show that 06‘/0x exists by working directly with the definition 
of the derivative. For any sufficiently small h € Rx {0}, let An: Ji x U1 > 


322 12. Integral Curves and Flows 


M(n, R) be the difference quotient 


6’ (t, 2 + hej) — O°(t, x) 
i, ; 


(An); (t,2) = 


Then 06°/Ox/ (t,x) = limpo(An)5(t, 2) if the limit exists. 
Because a C? map is differentiable, for sufficiently small 6 > 0 there is a 
map R: Bs(0) — R” such that 


V(£) — V(x) = DV(a)(@ — x) + R(&— 2) (12.12) 
and 
Rv) 
lim TT = 0. (12.13) 


Let us compute the t-derivative of Ap, using (12.12): 


& (An)i(t2) = - (Sores he;) — 59'(2)) 
= (Vi((t,@ + he;)) — Vi(O(t,2))) 
= & (6(t,x))(0* (t,x + he;) — O*(t,2)) 


+ R'(O(t, 2 + he;) — 0(t, »)) 


: R'(0(t,2 + zs) — 6(t, x)) 


= Gi(a(t, x))(An)§ (t, «)) 


By 12.13, given any ¢ > 0 there exists 6 > 0 such that |v| < 6 implies 
|R(v)|/|v]| < e. The Lipschitz estimate (12.8) for @ says that 


|O(t, a + he;) — A(t, x))| < K|he;| = K|h| 
for some constant K, and therefore |h| < 6/K implies that 


R(O(t,z + he;) — A(t, z)) 


i <e. 


Summarizing, we have shown that for h sufficiently small, A, is an “e- 
approximate solution” to the second equation of (12.11), in the sense that 


() 
adults zr) = G(O(t, r))An(t, zr) + E(t, x), 

where |E(t, )| < e. It also satisfies the initial condition Ap(tp, x) = In for 
each x. 
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Let (0,4) be the exact solution to (12.11). We will show that Ap con- 
verges to = as h — 0. To do so, we note that 


2 


<lo(t2) — An(t,2) 
= 2(u(t,2) — An(t,2)) 
(G(0(t, z)) v(t, x) — G(0(t, z))An(t, x) — E(t, 2)) 


< 2lW(t, x) — An(t, x)| (|G(9G, 2)| |Y(t, x) — An(t,x)| + €) 


< 2B\ y(t, x) ~~ An(t, 2)? at 2e|p(t, x) ~~ An(t, x)| 


IA 


(2B+ 1)|[(t, v) — An(t, 2)|? +6, 


where B = sup|G|, and the last line follows from the inequality 2ab < 
a?+b?, which is proved just by expanding (a—b)? > 0. Thus if J; c [—T,T], 


£ (eCPM MCE, 2) ba An(t,2)|*) a e2e7 (2B+)t < ete ABT 


Since w(to,z) = Ap(to, x), it follows by elementary calculus that 
e OB (t, 2) — An(t, 2)|? < e7eP@P TNT It — to], 


In particular, since ¢ can be made as small as desired by choosing h suffi- 
ciently small, this shows that lim;,_.9 A;,(t, 7) = w(t, z). Therefore, @ is C+ 
on the set J, x Uj. 

To show that @ is actually C! on its entire domain Jo x Up, we proceed 
just as in the proof of Theorem 12.9. For each x9 € Up, the argument 
above shows that @ is C' in some neighborhood J; x Uj of (to, 29). Let 
Tt be the supremum of the set of t > t) such that @ is Ct on a product 
neighborhood of [to, t} x {zo}. If (7,20) € Uo, we will show that 6 is C! on 
a product neighborhood of [to,7] x {ao}, which contradicts our choice of rT. 
(The argument for t < to is similar.) 

By the argument above, there exist ¢ > 0, a neighborhood U2 of 6(r, xo), 
and a C! map 3: (—e,+¢) x Uz — Up such that 


Choose t; € (7 — €,T) such that 0(t1,20) € U2. Then O(t,2) = B(t- 
ty, 0(t1, 2)) where both are defined, because they both solve the same ODE 
and are equal to 6(t1,2) when t¢ = ¢;. Since the right-hand side is smooth 
for x € Uz and ty —e <t <t1 +¢, this shows that 0 is C1 on [to, ti +€) x 
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(U2 Uj), which contradicts our choice of t. Therefore, 9 is C1 on its whole 
domain, thus proving (12.9) for k = 1. Moreover, we have also shown that 
(0°, 00° /Ox7) solves the variational equation (12.11). 

Now assume by induction that (12.9) is true for 1 < k < ko, and suppose 
V is of class C**+!. By the argument above, (6°, 06'/Ox/) solves the varia- 
tional equation. Since the first partial derivatives of V are of class C°, the 
inductive hypothesis applied to (12.11) shows that (6°, 00'/Oz7) is of class 
Cro-1, Since @ is of class C*°—! by the inductive hypothesis and therefore 
00'/dt is C’°—! by (12.10), it follows that all of the partial derivatives of 
6 are C*—!, so @ itself is C’°, thus completing the induction. 
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Problems 


12-1. 


12-2. 


12-3. 


12-4. 


Show that every smooth vector field with compact support is com- 
plete. 


Let M be acompact Riemannian n-manifold, and f € C(M). Sup- 
pose f has only finitely many critical points {p1,...,px} with cor- 
responding critical values {c,,...,c,}. (Assume without loss of gen- 
erality that c, < --- < cj.) For any a,b € R, define M, = f~'(a) 
and Mj,4) = f~'({a, |). If a is a regular value, note that M, is an 
embedded hypersurface in M. 


(a) Let X be the vector field X = grad f/|gradf|? on M 
{pi,.--,Pr}, and let 6 denote the flow of X. Show that 
f(0:(p)) = f(p) +t whenever 0;(p) is defined. 


(b) Let [a,b] C R be an interval containing no critical values of f. 
Show that 
0: [(0,b— al x Ma ors Mab] 


is a diffeomorphism, whose inverse is p t> ( f(p) — a, O(a — 


f(p),p)). 


[Remark: This result shows that M can be decomposed as a union 
of simpler “building blocks” —the product manifolds M{c,+¢,¢,,,—<] © 
I x M¢, 42, and the neighborhoods f~1!((e; — €,¢; + €)) of the critical 
points. This is the starting point of Morse theory, which is one of 
the deepest applications of differential geometry to topology. It is 
enlightening to think about what this means when M is a torus of 
revolution in R? obtained by revolving a circle around the z-axis, and 


f(@,y, 2) = #] 


Let M be a connected smooth manifold. Show that the group of 
diffeomorphisms of M acts transitively on M. More precisely, for any 
two points p,q € M, show that there is a diffeomorphism F’': M — M 
such that Fp) = q. [Hint: First prove the following lemma: If p,q € 
B” (the open unit ball in R"), there is a compactly supported vector 
field on B” whose flow @ satisfies 0; (p) = q.] 


Let M be a smooth manifold. A smooth curve 7: R — M is said to 
be periodic if there exists T > 0 such that y(t) = y(t’) if and only if 
t—t' =kT for some k € Z. Suppose X € J(M) and ¥ is a maximal 
integral curve of X. 


(a) Show that exactly one of the following holds: 


e ¥ is constant. 


e ¥ is injective. 
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12-5. 


12-6. 


12-7. 
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e ¥ is periodic. 


(b) Show that the image of y is an immersed submanifold of M. 


Show that there is only one smooth structure on R up to diffeomor- 
phism. More precisely, if M is any smooth manifold that is home- 
omorphic to R, show that M is diffeomorphic to R (with its stan- 
dard smooth structure). [Hint: First show that M admits a nowhere- 
vanishing smooth vector field. See Problem 10-1.] 


Let @ be a flow on an oriented manifold. Show that for each t € R, & 
is orientation-preserving wherever it is defined. 


All of the systems of differential equations considered in this chapter 
have been of the form 


y(t) =V(7), 


in which the functions V* do not depend explicitly on the independent 
variable ¢t. (Such a system is said to be autonomous.) If instead V is 
a function of (t,x) in some subset of R x R”, the resulting system is 
called nonautonomous; it can be thought of as a “time-dependent vec- 
tor field” on a subset of R”. This problem shows that local existence, 
uniqueness, and smoothness for a nonautonomous system follow from 
the corresponding results for autonomous ones. Suppose U C R” is an 
open set, J C R is an open interval, and V: J x U — R” is a smooth 
map. For any (to, 20) € J x U and any sufficiently small ¢ > 0, show 
that there exists a neighborhood Ug of xo in U and a unique smooth 
map w': (t9 — €,t9 +€) x Up — U such that for each x € Up, the 
curve y(t) = w(t, 2) is the unique solution on (to — €, to + €) to the 
nonautonomous initial-value problem 


y(t) = V*t,7@), 
7°(to) =z". 


(Hint: Replace this system of ODEs by an autonomous system in 
R"*1] 


13 


Lie Derivatives 


This chapter is devoted primarily to the study of a particularly important 
construction involving vector fields, the Lie derivative. This is a method 
of computing the “directional derivative” of one vector field with respect 
to another. We will see in this and later chapters that this construction 
has applications to flows, symplectic manifolds, Lie groups, and partial 
differential equations, among other subjects. 


The Lie Derivative 


We already know how to compute “directional derivatives” of functions on 
a manifold: Indeed, a tangent vector V, is by definition an operator that 
acts on a smooth function f to give a number V,f, which we interpret as 
the directional derivative of f in the direction V,. 

What about the directional derivative of a vector field? In Euclidean 
space, we have a perfectly good way of making sense of this: If V is a 
tangent vector at p € R”, and W is a smooth vector field on R”, we can 
define the “directional derivative” of W in the direction of V as the vector 


DyW = a Wortv — Wp 
; ; 


a (13.1) 


Writv = lim 
t=0 t—0 
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A standard calculation using the chain rule shows that DyW can be cal- 
culated by applying V to each component of W separately: 


3) 


DyW(p) =VW"(p) a5] - 
Pp 


Unfortunately, as we will see, this does not define a coordinate- 
independent operation. If we search for a way to make invariant sense of 
(13.1) on a manifold, we will see very quickly what the problem is. To begin 
with, we can replace p+tV by any curve y(t) that starts at p and whose 
initial tangent vector is V. But even with this substitution, the difference 
quotient still makes no sense because W,,(,) and W.(9) are elements of differ- 
ent vector spaces (Ty4)M and T,(9)M). We got away with it in Euclidean 
space because there is a canonical identification of each tangent space with 
R” itself; but on a manifold there is no such identification. Thus there is 
no coordinate-invariant way to make sense of the directional derivative of 
W in the direction of the vector V. 

Now suppose that V itself is a vector field instead of a single vector. In 
this case, we can use the flow of V to push values of W back to p and then 
differentiate. Thus, for any smooth vector fields V and W on a manifold 
M, let 6 be the flow of V, and define a vector (LyW), at each p € M, 
called the Lie derivative of W with respect to V, by 


(0_+)»Wo.@) — Wp 
t ? 


d ; 
(SvW)p = =) (8-t)«Wo,(p) = lim (13.2) 


dt |, 


provided the derivative exists. For small t # 0, the difference quotient 
makes sense at least, because 9; is defined in a neighborhood of p and both 
(0_1)xWo,(p) and W, are elements of T,M. 


Exercise 13.1. If V € R” and W is a smooth vector field on an open 
subset of R”, show that the directional derivative Dy W (p) defined by (13.1) 
is equal to (0; W)p», where V is the vector field V = V‘0/Ox' with constant 
coefficients in standard coordinates. 


This definition raises a number of questions: Does (6_1)«Wo,(p) always 
depend differentiably on t, so that the derivative in (13.2) always exists? 
If so, does the assignment p +> (LyW), define a smooth vector field on 
M? And most importantly, is there a reasonable way to compute (Ly W)>p, 
given that the only way to find the integral curves of V is to solve a system 
of ODEs, and most such systems cannot be solved explicitly? Fortunately, 
there are good answers to all these questions, but we will need to develop 
a few more tools in order to describe them. 
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We begin by defining another way to combine two vector fields to obtain a 
new vector field, seemingly unrelated to the Lie derivative. 

Let V and W be smooth vector fields on a smooth manifold M. Given 
a smooth function f: M@ — R, we can apply V to f and obtain another 
smooth function Vf, to which we can then apply the vector W, to obtain 
areal number W,(V f); of course we can also do the same thing the other 
way around. Applying both of these operators to f and subtracting, we 
obtain an operator [V,W],: C°(M) — R, called the Lie bracket of V and 
W, defined by 


[V, Wipf = V,(W f) = W,(Vf). 


Lemma 13.1. For any two vector fields V,W € J(M), the Lie bracket 
[V,W], satisfies the following properties. 


(a) [V,W]p € TpM. 


(b) The assignment p> [V,W], defines a smooth vector field [V,W] on 
M, which satisfies 


[V,W]f =VWf—Wwvy. (13.3) 


(c) If (x') are any local coordinates on M, then [V,W] has the coordinate 
e€XPTESsSion 


(13.4) 


vin= (vi wi) 


Ox3”’ 
or more concisely, 
6) 


= ae = = 
[V, W] = (VW? — WV?) =. 


Proof. First we prove that [V, W], is a tangent vector, i.e., a linear deriva- 
tion on the space of germs of functions at p. Clearly [V,W],f depends only 
on the values of f in a neighborhood of p, so it is well-defined on germs. 
As a map from C™(p) to R, it is obviously linear over R, so only the prod- 
uct rule needs to be checked. If f and g are smooth functions defined in a 
neighborhood of p, then 
[V, W],(f9) = Vo(W(f9)) — W,(V(f9)) 

=V,(fWg + gWf) —-W,(fV9+ oVf) 

= (Vpf)(Wpg) + f(p)Vp(W ag) + (Vrg) (Worf) + 9(p) Vp(W Ff) 

= (Wpf)(Vpg) = f(p)Wp(Vg) = (Wyg)(Vpf) a g(p)W,(V f) 
= f(p)(Vp(Wg) — W,(Vg)) + 9p) (Vo(WH) — W2(V f)) 
= f(P)IV,W]pg + g)IV, WI) pf. 
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This shows that [V,W], satisfies the product rule and therefore defines a 
tangent vector at p. 

Formula (13.3) is immediate from the definition of the Lie bracket, and it 
follows from this that [V, W] is a smooth vector field, because (13.3) defines 
a smooth function whenever f is smooth in an open subset of M. 

To prove (c), we just write V = V*0/dz' and W = W/0/0z! in coordi- 
nates, and compute: 


woe (wl) wi (oD 


Ox? OxI Ox? 
= yi? OF yigys OF yi OY" OF _ pays OF 
Ox* Ox) Ox' Ox) OxI Ox? OxI Ox? 
_ yi OW! of ; OV" of 
7 Ox? OxI Oxi Ox'’ 


where in the last step we have used the fact that mixed partial derivatives of 
a smooth function commute. Reversing the roles of the summation indices 
i and j in the second term, we obtain (13.4). 


Exercise 13.2. For each of the following pairs of vector fields V, W defined 
on R°, compute the Lie bracket [V, W]. 


_. O 20. _ O 

(a) V Cm 2xy By’ Ww a 
3) 0 
b) V= ; We , 
(De Ma a ae 
3) fe) te) te) 
(c) V oa Yas) WwW 5, tU¥aT 


Lemma 13.2 (Properties of the Lie Bracket). The Lie bracket sat- 
isfies the following identities: 


(a) BILINEARITY: For aj,a2 € R, [aiVi + a2V2,W] = ai[Vi,W] + 
a2[Va, WwW) and [V, ayWy + a2W)] = a1|V, Wi] + ag [V, W3]. 
(b) ANTISYMMETRY: [V,W] = —[W, V]. 
(c) JACOBI IDENTITY: [V, [W, X]] + [W, .X, V]] + [X, [V, W]] = 0. 
Proof. Bilinearity and antisymmetry are obvious consequences of the defi- 
nition. The proof of the Jacobi identity is just a computation: 
[V.[W, X]]f + [W,[X, VIF + (4, [V, WI]f 
=V(W, Xf -— |W, X]Vf+W[X,V]f 
—|[X,VIWS+ X[V,WIf -[V,W] Xf 
=VWXf-VXWf-WxXVf+xXWVf+tWXVfE-WVxXf 
—-XVWf+VXWf+XVWE-XWVf-VWXf+WVXf. 


In this last expression, all the terms cancel in pairs. 
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Proposition 13.3 (Naturality of the Lie Bracket). Let F: M— N 
be a smooth map, and let Vi,V2 € TM) and W,,W2 € T(N) be vector 
fields such that V; is F'-related to W;, i = 1,2. Then [Vi, V2] is F'-related to 
[(W1, W2]. If F is a diffeomorphism, then F.[Vi, V2] = [FiVi, FV]. 


Proof. Using Lemma 3.17 and fact that V; and W; are F-related, 
ViV2(fo F) = Vi((W2f) oF) 
= (WWof) oF. 
Similarly, 
VaVi(f oF) = (W2Wif)o F. 
Therefore, 
[Vi Val(fo F) =ViVa(fo F) —Vavi(f oF) 


=(WiW2f)oF —-(WiWif)oF 
= ((Wi, Wal f) oF. 
The result then follows from the lemma. The statement when F is a diffeo- 


morphism is an obvious consquence of the general case, because W; = F;, V; 
in that case. 


Proposition 13.4. Let N be an immersed submanifold of M, and suppose 
V,W €T(M). IfV and W are tangent to N, then so is [V,W]. 


Proof. This is a local question, so we may replace N by an open subset of 
N that is embedded. Then Proposition 5.8 shows that a vector X € T,M 
is in T,N if and only if Xf = 0 whenever f € C'°(M) vanishes on N. 
Suppose f is such a function. Then the fact that V and W are tangent to 
N implies that Vf|~ = Wf|n = 0, and so 


[V,W],f =V,(Wf) —W,(V f) = 0. 


This shows that [V,W], € T,.N, which was to be proved. 


We will see shortly that the Lie bracket [V, W] is equal to the Lie deriv- 
ative Ly W, even though the two quantities are defined in ways that seem 
totally unrelated. Before doing so, we need to prove one more result, which 
is of great importance in its own right. If V is a smooth vector field on M, 
a point p € M is said to be a singular point for V if V, = 0, and a regular 
point otherwise. 


Theorem 13.5 (Canonical Form for a Regular Vector Field). Let 
V be a smooth vector field on a smooth manifold M, and let p © M be a 
regular point for V. There exist coordinates (x') on some neighborhood of 
p in which V has the coordinate expression 0/Ozx'. 
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Proof. By the way we have defined coordinate vector fields on a manifold, a 
coordinate chart (U,y) will satisfy the conclusion of the theorem provided 
that (y~!),.(0/dx') = V, which will be true if and only if y~! takes lines 
parallel to the xz! axis to the integral curves of V. The flow of V is ideally 
suited to this purpose. 

Begin by choosing any coordinates (y*) on a neighborhood U of p, with 
p corresponding to 0. By composing with a linear transformation, we may 
assume that V, = 0/dy'|p. Let 6: D(V) — M be the flow of V. There 
exists € > 0 and a neighborhood Up C U of p such that the product open 
set (—e,€) x Up is contained in D(V) and is mapped by @ into U. 

Let S C R"! be the set 


BST cs 320, Bae”) Eo}: 
and define a smooth map w: (—e,¢) x S — U by 
w(t, x7,...,2") = A(t, (0, 27,...,2”)). 
Geometrically, for each fixed (x?,...,2"), ~ maps the interval (—e,¢) x 


{(x?,...,2”)} to the integral curve through (0,2?,...,2”). 
First we will show that 7 pushes 0/0t forward to V. We have 


6) O 
Wx vi f= ap. 


(13.5) 


= Vis (to,00) Ss 


where we have used the fact that t +> @(t,(0,20)) is an integral curve 
of V. On the other hand, when restricted to {0} x S, W(0,2?,...,2") = 
GO One cnt) SS (00 9 og By SO 


0 


* Oat 


ad 


v = 55 
(00) OY 


(0,0) 


Since wx: T(o,9)((-€, €) x S) — T,M takes a basis to a basis, it is an isomor- 
phism. Therefore, by the inverse function theorem, there are neighborhoods 
W of (0,0) and Y of p such that ~: W — Y is a diffeomorphism. 

Let p= w!: Y — W. Equation (13.5) says precisely that V is equal 
to the coordinate vector field 0/0t in these coordinates. Renaming t to z?, 
this is what we wanted to prove. 


This theorem implies that the integral curves of V near a regular point 
behave, up to diffeomorphism, just like the x!-lines in R”, so that all of 
the interesting local behavior is concentrated near the singular points. Of 
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course, the flow near singular points can exhibit a wide variety of behav- 
iors, such as closed orbits surrounding the singular point, orbits converging 
exponentially or spiraling into the singular point as t — oo or —oo, and 
many more complicated phenomena, as one can see in any good differential 
equations text that treats systems of ODEs in the plane. This is the start- 
ing point for the subject of smooth dynamical systems, which is the study 
of the global and long-time behavior of flows of vector fields. 

We are now in a position to prove the promised formula for computing 
Lie derivatives. 


Theorem 13.6. For any smooth vector fields V and W on a smooth man- 
ifold M, Ly W = [V,W]. 


Proof. We will show that (Ly W), = [V,W], for each p € M. We consider 
two cases. 

CASE I: p is a regular point for V. In this case, we can choose coordinates 
(x*) near p such that V = 0/0z* in coordinates. In this case, the flow of V 
is easy to compute explicitly: 


O(a) = (a +t, 07,32. ,2). 


Therefore, for each fixed t, the matrix of (6;). in these coordinates (the 
Jacobian matrix of 6;) is the identity at every point. Consequently, 


: 0 

(0_+)«Wo,(p) = (O_4)« W?3 (ag? +4,27,. 6 io”) a 
Ox! |g, (p) 
= Wi? (2! 4+ t,x? x”) sae 
ND Pata: all 
Pp 
Using the definition of the Lie derivative, 

d : 0 

L a I(gt +t, 27,...,2") — 
( vW)p dt Leas (x + 7 v ’ »v ) Oxi : 
ows i ww JO 
Pp 


On the other hand, using the formula (13.4) for the Lie bracket in coordi- 
nates, [V, W], is easily seen to be equal to the same expression. 

CASE II: p is a singular point for V. In this case, we cannot write down 
the flow explicitly. However, it does have the property that 6;(p) = p for 
all t € R (Lemma 12.8), and therefore (9_;). maps T,,M to itself. Since the 
matrix entries of (0_;).: T,M — T,M in any coordinate system are partial 
derivatives of 9 and therefore are smooth functions of t, it follows that the 
components of the T,, M-valued function t +> (6_¢)«Wo,(p) = (0-+)«Wp with 
respect to the coordinate basis are also smooth functions of t. If tr» X(t) 
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is any smooth curve in 7;,,M, then for any smooth function f defined on a 


neighborhood of p, 
d eee Of 
(Sx) 1= (4x0) Ho 


= (Sx ho) 
(GOR, 


Applying this to (@_;).W, and using the definition of the Lie derivative, 


(vWf = (5 


(0-1).Wp) f 


t=0 


==] (O-)-WS) 


On the other hand, since V, = 0, we have 


[V, Wipf = V,(W f) = W,(V f) = —-W,(V f), 


which is equal to (Ly W),f. 


Corollary 13.7. If V and W are as in the statement of the theorem, then 
(a) The assignment p> (LyW), is a smooth vector field on M. 
(b) LyW =—-LyV. 
(c) IfF: MN is a diffeomorphism, then F.(LyW) = &p.vF.W. 


Exercise 13.3. Prove this corollary. 
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Commuting Vector Fields 


Two smooth vector fields are said to commute if [V, W] = 0, or equivalently 
if VWf = WVf for every smooth function f. One simple example of 
a pair of commuting vector fields is 0/Ox* and 0/Ox/ in any coordinate 
system: because their component functions are constants, their Lie bracket 
is identically zero. 

We will see that commuting vector fields are closely related to another 
important concept. Suppose W is a smooth vector field on M, and @ is a 
flow on M. We say W is invariant under 6 if (0;),W = W on the image of 
6:. More explicitly, this means that 


(0:)xW,p = Wo,(p) for all (¢,p) in the domain of 0. 


We will show that commuting vector fields are invariant under each 
other’s flows. The key is the following somewhat more general result about 
F-related vector fields. 


Lemma 13.8. Suppose F: M — N is a smooth map, V € T(M), and 
W €T(N), and let 0 be the flow of V and w the flow of W. Then V and 
W are F-related if and only if for each t € R, Y,0F = Fo on the domain 
of Ot . 


M 


Or Vt 


M 


7 N 


Proof. The commutativity of the diagram means that the following holds 
for all (t,p) in the domain of 0: 


Wt 0 F(p) = Fo 6,(p). 


If we let D, C R denote the domain of 0”), this is equivalent to 
pF) (t) = Fod®)(t), tE Dy. (13.6) 


Suppose first that V and W are F-related. If we define y: D, — N by 
y= Fo), then 


¥ () = (F 00) 
= F.(0"(t)) 
= F. Vo) (2) 
= W Foo) (t) 


= WW y(t)> 
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so 7 is an integral curve of W starting at Fo 6‘)(0) = F(p). By uniqueness 
of integral curves, therefore, the maximal integral curve ~"®)) must be 
defined at least on the interval D,, and y(t) = "))(¢) on that interval. 
This proves (13.6). 

Conversely, if (13.6) holds, then for each p € M we have 


FV» = F.(6""(0)) 
= (F00)'(0) 


-_ pF ))"(0) 
= Wr), 


which shows that V and W are F-related. 


Proposition 13.9. Let V and W be smooth vector fields on M, with flows 
8 and w, respectively. The following are equivalent: 


(a) V and W commute. 
(b) LyW =LyV =0. 
(c) W is invariant under the flow of V. 
(d) V is invariant under the flow of W. 


(e) 6,0, = u,00, wherever either side is defined. 


Proof. Clearly (a) and (b) are equivalent because LyW = [V,W] = 
—LyV. Part (c) means that (6_+)«Wo,(p) = Wp whenever (—t,p) is in 
the domain of 0, which obviously implies (b) directly from the definition of 
LyW. The same argument shows that (d) implies (b). 

To prove that (b) implies (c), let p € M be arbitrary, let D, C R denote 
the domain of the integral curve 6), and consider the map X: Dy, — TpM 
given by the time-dependent vector 


X(t) = (0-+)« (Wop) € TpM. (13.7) 


This can be considered as a smooth curve in the vector space T,M. We 
will show that X(t) is independent of t. Since X(0) = W,, this implies that 
X(t) = W, for all t ¢ D,, which says that W is invariant under 6. 

The assumption that 2)W =0 means precisely that the t-derivative of 
(13.7) is zero when t = 0; we need to show that this derivative is zero for 
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all values of t. Making the change of variables t = to + s, we obtain 


d 
X'(to) =F (0_1)«Wo, 
dt|,_., (p) 
d 
= Ag ber (0—t-s)»Wo,445(p) 
d (13.8) 
els (0—t0 )»(O—s)» Wo. (61, (p)) 
d 
— (0_t5)« ds = (0_s)«Wo, (645 (p)) 


= (0-1)«(LvW)o,,) = 9- 


(The equality on the next-to-last line follows because (0—1,)«: To,,(p)M — 
T,M is a linear map that is independent of s.) The same proof also shows 
that (b) implies (d). 

To prove that (c) and (e) are equivalent, we let M; denote the domain 
of 0; and use Lemma 13.8 applied to the map F = 6: M, — 6:(M:). 
According to that lemma, 6; 0 Ws = ws 0 @& on the set where 0; o a, is 
defined if and only if (6;),.W = W on 6;(M;), which is to say if and only 
if W is invariant under 6. By reversing the roles of V and W, we see that 
this is also true on the set where w, o 6; is defined. 


As we mentioned above, one example of a family of commuting vector 
fields is given by the coordinate vector fields 0/02’, i =1,...,n. The next 
theorem shows that up to diffeomorphism, any collection of independent 
commuting vector fields is of this form locally. 


Theorem 13.10 (Canonical Form for Commuting Vector Fields). 
Let M be a smooth n-manifold, and let Vi,...,V% be smooth vector fields 
on an open subset of M whose values are linearly independent at each 
point. Then the following are equivalent: 


(a) There exist coordinates (x) in a neighborhood of each point such that 
Ves O00 4 = 1yceg KR 


(b) [Vi,V;] =0 for all i and j. 


Proof. The fact that (a) implies (b) is obvious because the coordinate vec- 
tor fields commute and the Lie bracket is coordinate-independent. 

To prove the converse, suppose V},..., Vz are vector fields satisfying (b). 
The basic outline of the proof is entirely analogous to that of the canonical 
form theorem for one nonvanishing vector field (Theorem 13.5), except that 
we have to do a bit of work to make use of the hypothesis that the vector 
fields commute. 

Choose coordinates (y’) on a neighborhood U of p such that p corre- 
sponds to 0 and V; = 0/dy' at 0 for i = 1,...,k. Let 6; be the flow of 
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V, for 1 = 1,...,k. There exists ¢« > 0 and a neighborhood W of p such 
that the composition (0%)+, © (Ax—1)%,_1 0°++ © (81)4, is defined on W and 
maps W into U whenever |ti|, ..., |t,| are all less than e¢. (Just choose 
€; > 0 and U; Cc U such that 0; maps (—¢1,¢1) x U, into U, and then 
inductively choose ¢; and U; such that 0; maps (—é;,¢;) x U; into Uj_1. 
Taking « = min{e;} and W = U;, does the trick.) 

As in the proof of the canonical form theorem for one nonvanishing vector 
field, let 


Safa". o.e) 2 Os 0a 2 ye wh. 
Define w: (—e,e)* x W > U by 


We will show first that 
0 
P| ee el a 
u Ox? : 


Because all the flows 6; commute with each other, we have 


(ga) f= (we....2")) 


where the hat means that (;),: is omitted. Now, for any p € M, t 
(9;):(p) is an integral curve of V;, so this last expression is equal to (Vi) ya) f, 
which proves the claim. 

Next we will show that w, is invertible at p. The computation above 
shows that fori =1,...,k, 


6) O 
se. SA oe = V, => 27 . 
v Ox" | P Oy? 3 

On the other hand, since ¥(0,...,0,2*+!,...,2”) = 
(0,...,0,a*+1,...,””), it follows immediately that 

O O 

We eal aa 

x* |o ye |e 

fori = k+1,...,n as well. Thus wv, takes a basis to a basis, and is therefore 


a diffeomorphism in a neighborhood of 0. It follows that y = w~! is the 
desired coordinate map. 
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Lie Derivatives of Tensor Fields 


The Lie derivative operation can be extended to tensors of arbitrary rank. 
As usual, we focus on covariant tensors; the analogous results for contravari- 
ant or mixed tensors require only minor modifications. 

Let X be asmooth vector field on a smooth manifold M, and let @ be its 
flow. Near any p € M, if t is sufficiently close to zero, 6; is a diffeomorphism 
from a neighborhood of p to a neighborhood of @;(p). Thus 6% pulls back 
smooth tensor fields near 6;(p) to ones near p. 

Given a covariant tensor field 7 on M, we define the Lie derivative of 7 
with respect to X, denoted by £x7, as 


0 r _ 970. (p) — Tp 
(LxT)p = Bs (8:7)p = lim ——~——. 


Because the expression being differentiated lies in T*(T,M) for all t, 
(£x7)p, makes sense as an element of T*(T,M). We will show below that 
£xT is actually a smooth tensor field. First we prove the following impor- 
tant properties of Lie derivatives of tensors. 


Proposition 13.11. Let M be a smooth manifold. Suppose X, Y are 
smooth vector fields on M, 0, T are smooth covariant tensor fields, w, 
are differential forms, and f is a smooth function (thought of as a 0-tensor 


field). 

(a) xf =Xf. 

(b) £Lx(fo) = (Lx flo + fhxeo. 

(c) &x(o@r) = (Lxo) ®t +0 @Lxr. 
(i) LGA HE xo KG Shean. 
yh) Sani ee ia. 


(f) For any smooth vector fields Y1,...,Yx, 


£x(o(%,..-,Ye)) = (Lx0)(Y1,..-, Ye) 
+a(LxVi,...,Y¥r) te: +0(%,...,4xYp). (13.9) 


Proof. The first assertion is just a reinterpretation of the definition in the 
case of a 0-tensor. Because 67 f = f o &, the definition implies 


Lxfte)= Ff fle) = Xf. 
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The proofs of (b), (c), (d), (e), and (f) are essentially the same, so we 
will prove (c) and leave the others to you. 


95 (7 @ T)o,(p) —(F @T)p 


(Lx(¢ @T))p = lim 


t—0 t 
aiat 006. (p) ® 0°76, (p) — Op & Tp 

t—0 t 
— Tim 2 20e() © FF T04(p) — 4 F64(p) © Te 

t—0 t 

Phe 906, (p) ®@ Tp — Tp @ Tp 
t—0 t 

= lim 6* 9: 70.(p) — T li 9 F0.(p) — Op 
F #55 + 7:(p) © t Si t aly 


The other parts are similar, and are left as an exercise. 


Exercise 13.4. Complete the proof of the preceding proposition. 


Corollary 13.12. If X is a smooth vector field and o is a smooth covariant 
tensor field, then £xo can be computed by the following expression: 


(Exe SYS RCO =o POMS ay a 
SGV io WE Vie DOV SMO) 


It follows that xo is smooth. 


Proof. Formula (13.10) is obtained simply by solving (13.9) for £xo, and 
replacing Lx f by Xf and LxY; by [X,Y]. It then follows immediately 
that £xo is smooth, because its action on smooth vector fields yields a 
smooth function. 


Corollary 13.13. If f ¢ C™(M), then £x(df) =d(Lxf). 
Proof. Using (13.10), we compute 


(Lxdf)(Y) = X(df(y)) — df[X, Y] 


=XYf-[X,Y]f 

= XYf-(XYf-YXf) 
oy XF 

= A(X f)(Y) 


= d(Lx f)(Y). 
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Differential Forms 


In the case of differential forms, the exterior derivative yields a much more 
powerful formula for computing Lie derivatives. Although Corollary 13.12 
gives a general formula for computing the Lie derivative of any tensor field, 
this formula has a serious drawback: In order to calculate what £xo does 
to vectors Y;,...,Y, at a point p € M, it is necessary first to extend the 
vectors to vector fields in a neighborhood of p. The formula in the next 
proposition overcomes this disadvantage. 


Proposition 13.14. For any vector field X and any differential k-form w 
on a smooth manifold M, 


Lxw =X 1(dw)+d(X Iw). (13.11) 


Proof. The proof is by induction on k. We begin with a 0-form f, in which 
case 


X (df) + dX sf) =X sdf =df(x)=Xf =Lxyf, 


which is (13.11). 

Any 1-form can be written locally as a sum of terms of the form udu 
for smooth functions u and v, so to prove (13.11) for 1-forms, it suffices 
to consider the case w = udv. In this case, using Proposition 13.11(d) and 
Corollary 13.13, the left-hand side of (13.11) reduces to 


L£x(udv) = (Lxu)du + u(Lxdv) 
= (Xu)dv+ud(Xv). 


On the other hand, using the fact that interior multiplication is an anti- 
derivation, the right-hand side is 
X 1d(udv) + d(X i(udv)) = X 1(duA dv) + d(uXv) 
= (X tdu) A du — dud (X 1dv) 
+ud(Xv) + (Xv)du 
= (Xu)du — (Xv)du+ud(Xv) + (Xv)du. 
(Remember that X 1du = du(X) = Xu, and a wedge product with a 
0-form is just ordinary multiplication.) After cancelling the two (Xv)du 
terms, this is equal to £x (udv). 
Now let k > 1, and suppose (13.11) has been proved for forms of degree 
less than k. Let w be an arbitrary k-form, written in local coordinates as 


/ F . 
w= So wre A+++A dx". 
I 
Writing a = w; dx" and B = dx” \---Adx**, we see that w can be written 


as a sum of terms of the form aA 3, where a is a 1-form and 3 is a (k— 1)- 
form. For such a term, Proposition 13.11(d) and the induction hypothesis 
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imply 
Lx(ad B) = (Lea) A B+anr (Lx) 
= (X Ida+d(X 1a)) \G+aA(X 1d64+d(X 18)). 
(13.12) 
On the other hand, using the fact that both d and ix are antiderivations, 
we compute 
X Id(aA B)+d(X 1(aA B)) 
=X 1(da\ B-—aAdB)+d((X 1a)\B-ad(X if)) 
= (X Jda)\ B+ daN(X 18) —(X Ja) Adp 
+aA(X 1dB)+d(X 1a) A B+ (X 1a) Adg 
—daA(X 18)+aAd(X 18). 


After the obvious cancellations are made, this is equal to (13.12). 


Corollary 13.15. If X is a vector field and w is a differential form, then 
Ly (dw) = d(Lxw). 
Proof. This follows from the preceding proposition and the fact that d? = 0: 
Lxdw = X 1d(dw) + d(X 1dw) 
= d(X tdw); 
d£Lxw = d(X idw+d(X 1w)) 
= d(X Jdw). 


As promised, Proposition 13.14 gives a formula for the Lie derivative 
of a differential form that can be computed easily in local coordinates, 
without having to go to the trouble of letting the form act on vector fields. 
In fact, this leads to an easy algorithm for computing Lie derivatives of 
arbitrary tensor fields, since any tensor field can be written locally as a 
linear combination of tensor products of 1-forms. This is easiest to illustrate 
with an example. 


Example 13.16. Suppose T is an arbitrary smooth symmetric 2-tensor 
field on a smooth manifold M, and let Y be a smooth vector field. We will 
compute the Lie derivative LyT in coordinates (x*). First, we observe that 
Lydx' = d(Y 1dx') +Y itd(dz') = dY". Therefore, 


LyT = Ly(T;;)dz' @ dxi + T;;(Ly dz’) ® dx) + T;; dx’ @ (Ly dz’) 
= YT;,; dx' + T;; dY* ® dx’ + Ti; dx’ @ dY/ 


oy" oyr 
— ez + Tik=> + Tix a) dx’ @ dx’. 


Ox? Oxi 
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What is the meaning of the Lie derivative of a tensor field with respect to 
a vector field X? We have already seen that the Lie derivative of a vector 
field Y with respect to X is zero if and only if Y is invariant along the flow 
of X. It turns out that the Lie derivative of a covariant tensor field has 
exactly the same interpretation. We say that a tensor field o is invariant 
under a flow 6 if 0*0 = o on the domain of 6. 

The next lemma shows how the Lie derivative can be used to compute 
t-derivatives at times other than ¢ = 0; it is a generalization of formula 
(13.8) to tensor fields. 


Lemma 13.17. Let M be a smooth manifold, X € TJ(.M), and let @ be the 
flow of X. For any smooth covariant tensor field tT and any (to,p) in the 
domain of 6, 


d 
dt 


9 (Te.(p)) = 94,(£XT) 6%, (p)- 


t=to 


Proof. Just as in the proof of Proposition 13.9, the change of variables 
t=to+s yields 


d 
dt 


9 (Te. (p)) = (Pto-+s)* 76.445 (p) 


t=to 


(9t.)" (@s)* 70. (645 (p)) 


af (9s)"T0. (045 (p)) 
= (91)"(LvW)o,, (p)- 


Proposition 13.18. Let M be a smooth manifold and let X € J(M). A 
smooth covariant tensor field T is invariant under the flow of X if and only 
if Lxt = 0. 


Proof. Let @ denote the flow of X. If 7 is invariant under 6, then 0%7 = 7 
for all t. Inserting this into the definition of the Lie derivative, we see 
immediately that x7 = 0. 

Conversely, suppose £7 = 0. For any p € M, let D, denote the domain 
of 9), and consider the smooth curve T: D, + T*(TM) defined by 


T(t) = 9: (Te, (p))- 


Lemma 13.17 shows that T’(t) = 0 for allt € D,. Because D, is a connected 
interval containing zero, this implies that T(t) = T(0) = 7, for all t € Dp. 
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This is the same as 


9 (Te. (p)) = Tp, 


which says precisely that 7 is invariant under 0. 


Killing Fields 


Let (M,g) be a Riemannian manifold. A vector field Y on M is called a 
Killing field for g if g is invariant under the flow of Y. By Proposition 13.18, 
this is the case if and only if Lyg = 0. 

Example 13.16 applied to the case T = g gives the following coordinate 
expression for Ly g: 


0.0 oy® oy® 
(Lyq) (2. sat) =Yoij+ Ijk ae + ik ZF (13.13) 


Example 13.19 (Euclidean Killing Fields). Let g be the Euclidean 
metric on R”. In standard coordinates, the condition for a vector field to 
be a Killing field with respect to g reduces to 


oY oy? 


ax t Oxi ~° 


It is easy to check that all constant vector fields satisfy this equation, as 
do the vector fields 


3) 0 


j 
gy’ — — 73 — 
Ox Ox?’ 


which generate rotations in the (2*, x/)-plane. 


The Divergence 


For our next application, we let (M,g) be an oriented Riemannian n- 
manifold. Recall that the divergence of a smooth vector field X € T(M) is 
the smooth function div X characterized by 


(div X)dV, = d(X 1dV,). 


Now we can give a geometric interpretation to the divergence, which ex- 
plains the choice of the term “divergence.” Observe that formula (13.11) 
for the Lie derivative of a differential form implies 


LxdV, = X 1d(dV,) + d(X 1dV,) = (div X)dV5, 


because the exterior derivative of any n-form on an n-manifold is zero. 
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A flow 6 on M is said to be volume preserving if for every compact 
domain of integration D C M and every t € R such that D is contained in 
the domain of @;, Vol(@:(D)) = Vol(D). It is volume increasing if for any 
such D with positive volume, Vol(@;(D)) is a strictly increasing function of 
t, and volume decreasing if it is strictly decreasing. Note that the properties 
of flow domains ensure that, if D is contained in the domain of 0; for some 
t, then the same is true for all times between 0 and t. The next proposition 
shows that the divergence of a vector field is a quantitative measure of the 
tendency of its flow to “spread out” or diverge. 


Proposition 13.20. Let M be an oriented Riemannian manifold and let 
XeET(M). 


(a) The flow of X is volume preserving if and only if div X = 0. 


(b) If div X > 0, then the flow of X is volume increasing, and if div X < 
0, then it is volume decreasing. 


Proof. Let @ be the flow of X, and for each t let M; be the domain of 6;. 
If D is a compact domain of integration contained in M;, then 


Vol(0,(D)) = | dV, = | 0% dV5. 
6:(D) D 


Because the integrand is a smooth function of t, we can differentiate this ex- 
pression with respect to t by differentiating under the integral sign. (Strictly 
speaking, we should use a partition of unity to express the integral as a sum 
of integrals over domains in R”, and then differentiate under the integral 
signs there. The details are left to you.) Using Lemma 13.17, we obtain 


d 6) 
di Lip Vol(6;(D)) = i OL es 


= f etexen) 
D 


(A dV) 


= if 6% ((div X)dV,) 
D 


=f, (div X)dV,. 
949 (D) 


It follows that div X = 0 implies that Vol(@;(D)) is a constant function 
of t, while div X > 0 or div X < 0 implies that it is strictly increasing or 
strictly decreasing, respectively. 

Now assume that 6 is volume preserving. If div X #4 0 at some point 
p € M, then there is some open set U containing p on which div X does 
not change sign. If div X > 0 on U, then X generates a volume increasing 
flow on U by the argument above. In particular, for any coordinate ball 
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B such that B C U and any t > 0 sufficiently small that @;(B) Cc U, we 
have Vol(#:(B)) > Vol(B), which contradicts the assumption that 0 is vol- 
ume preserving. The argument in the case div X < 0 is exactly analogous. 
Therefore div X = 0. 


Symplectic Manifolds 


Let (M,w) be a symplectic manifold. (Recall that this means a smooth 
manifold M endowed with a symplectic form w, which is a closed nonde- 
generate 2-form.) One of the most important constructions on symplec- 
tic manifolds is a symplectic analogue of the gradient, defined as follows. 
Because of the nondegeneracy of w, the bundle map w: TM — T*M 
given by @(X)(Y) = w(X,Y) is an isomorphism. For any smooth func- 
tion f € C™(M), we define the Hamiltonian vector field of f to be the 
vector field Xf defined by 


X;=o" (df), 
so Xf is characterized by 
w(X-,Y) =df(Y)=Yf 
for any vector field Y. Another way to write this is 
X- iw = df. 


Example 13.21. On R?” with the standard symplectic form w = 
yo, dv’ A dy’, Xf can be computed explicitly as follows. Writing 


for some coefficient functions (A’, B’) to be determined, we compute 
CaS aca se ade! Ady’ 
GA OxI Oys a 
= oS At dy’ — B? dz’. 
i=1 
(When working with the standard symplectic form, like the Euclidean met- 


ric, it is usually necessary to insert explicit summation signs.) On the other 
hand, 


Ohh WO a8 
df =o awit + ayitu 
i=1 


Applications 347 


Setting these two expressions equal to each other, we find that A’ = Of /Oy' 
and Bt = —Of /dx", which yields the following formula for the Hamiltonian 
vector field of f: 


Gof do af a 
Ap= Do aioe Be Bae (13.14) 


Although the definition of the Hamiltonian vector field is formally analo- 
gous to that of the gradient on a Riemannian manifold, Hamiltonian vector 
fields differ from gradients in some very significant ways, as the next lemma 
shows. 


Proposition 13.22 (Properties of Hamiltonian Vector Fields). 
Let (M,w) be a symplectic manifold and let f € C™@(M). 


(a) f is constant along the flow of Xf, i.e., if 0 is the flow, then fo6;(p) = 
f(p) for all (t,p) in the domain of 0. 


(b) At each regular point of f, the Hamiltonian vector field X is tangent 
to the level set of f. 


Proof. Both assertions follow from the fact that 


Xpf = df(Xp) = 0(X7, Xp) =0 


because w is alternating. 


A vector field X on M is said to be symplectic if w is invariant under the 
flow of X. It is said to be Hamiltonian (or globally Hamiltonian) if there 
exists a smooth function f such that X = Xy, and locally Hamiltonian 
if every point p has a neighborhood on which X is Hamiltonian. Clearly 
every globally Hamiltonian vector field is locally Hamiltonian. 


Proposition 13.23 (Hamiltonian and Symplectic Vector Fields). 
Let (M,w) be a symplectic manifold. A smooth vector field on M is sym- 
plectic if and only if it ts locally Hamiltonian. Every locally Hamiltonian 
vector field on M is globally Hamiltonian if and only if Hjp(M) = 0. 


Proof. By Proposition 13.18, a vector field X is symplectic if and only if 
Lxw = 0. Using formula (13.11) for the Lie derivative of a differential form, 
we compute 


Lxw =d(X Iw) + X 1(dw) = d(X Iw). (13.15) 


Therefore X is symplectic if and only if the 1-form X Iw is closed. On the 
one hand, if X is locally Hamiltonian, then in a neighborhood of each point 
there is a function f such that X = Xf, so X lw = Xf 1w = df, which 
is certainly closed. Conversely, if X is symplectic, then by the Poincaré 
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lemma each point p € M has a neighborhood U on which the closed 1-form 
X lw is exact. This means there is a smooth function f defined on U such 
that X 1w = df, which means X = Xy on U. 

Now suppose Hjp(M) = 0. Then every closed form is exact, so for any 
locally Hamiltonian (hence symplectic) vector field X there is a smooth 
function f such that X Iw = df. This means that X = Xf, so X is globally 
Hamiltonian. Conversely, suppose every locally Hamiltonian vector field is 
globally Hamiltonian. Let 7 be a closed 1-form, and let X be the vector 
field X = ©~1n. Then (13.15) shows that Lxw = 0, so X is symplectic 
and therefore locally Hamiltonian. By hypothesis, there is a global smooth 
function f such that X = Xy, and then unwinding the definitions, we find 
that 7 = df. 


Using Hamiltonian vector fields, we define an operation on functions 
similar to the Lie bracket of vector fields. Given f,g € C™(M), we define 
their Poisson bracket {f,g} € C°(M) by 


{fg} = Xpg = w(Xq, Xz). 
Two functions are said to Poisson commute if their Poisson bracket is zero. 


Example 13.24. Using the result of Example 13.21, we can easily com- 
pute the Poisson bracket of two functions f,g on R?”: 


_ Of Og | Of dg 


Proposition 13.25 (Properties of the Poisson Bracket). Let 
(M,w) be a symplectic manifold, and f,g © C®@(M). 


(b) g is constant along the flow of X¥ if and only if {f,g} =0. 


(c) X¢p.g} = [Xp XQ]. 


Proof. Part (a) is evident from the characterization {f,g} = w(X,,XyF), 
and (b) from {f, 9} = Xyg. The proof of (c) is a computation using Propo- 
sition 13.11(e): 


dt f, g} = d(X yg) 
= d(£x,9) 
=Lx,dg 
=Lx,(Xg 4) 
= (Lx,Xg) w+ Xg41hx,w 
= [X>,X5] tw, 


which is equivalent to (c). 
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Our next theorem, called the Darboux theorem, is central in the theory 
of symplectic structures. It is a nonlinear analogue of the canonical form 
for a symplectic tensor given in Proposition 9.17. 


Theorem 13.26 (Darboux). Let (M,w) be a 2n-dimensional symplectic 
manifold. Near every point p € M, there are coordinates (x, y',...,2”",y”) 
in which w is given by 


n 
w= So dx A dy’. (13.17) 

i=1 
Any coordinates satisfying the conclusion of the Darboux theorem are 
called Darboux coordinates, symplectic coordinates, or canonical coordi- 
nates. Before we begin the proof, let us prove the following lemma, which 
shows that Darboux coordinates are characterized by the Poisson brackets 

of the coordinate functions. 


Lemma 13.27. Let (M,w) be a symplectic manifold. Coordinates (x*, y*) 
on an open set U C M are Darbouz coordinates if and only if their Poisson 
brackets satisfy 


{2,27} ={y,y}=0; {2',y?} =—6%. (13.18) 
Proof. One direction is easy: If w is given by (13.17), then formula 
(13.16) shows that the Poisson brackets of the coordinate functions sat- 


isfy (13.18). Conversely, If (x*, y’) are coordinates whose Poisson brackets 
satisfy (13.18), then the Hamiltonian fields of the coordinates satisfy 


ded (Xp) = Xqi(a!) = {0',21} =0, 


dy? (X,,) = Xyi (y’) = {z", yt = =O 
dx} (Xyi) = Xy(2?) = {y', 27} = 6 
dy? (Xy:) = Xyi (y’) =, fy", y"} = 
This implies that 
0 0 
XS X= 
* Oy’ Y Ont 


If we write 
w= Ajjdax" A dad + B,jdax' A dy? + Cijdy’ x dy’, 
with Aj; = —A,; and Ci; = —Cj;, then the coefficients are determined by 


a 9 . 
ae (a sai) = 70(Xyi,Xyi) = atu" y'} = 0. 


0 0 et is ss 
oe —— i j= - Jal — §, 
Bi; a o(S i) w(Xy Xy ) {x sy} 0) 


a a = 
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This shows that w has the form (13.17). 


Proof of the Darbouz Theorem. We will show by induction on k that for 
each k = 0,...,n there are functions (x!,y',...,2*,y”) satisfying (13.18) 
near p such that {dz!,dy',...,dax*, dy*} are independent at p. When k = n, 
this proves the theorem. 

For n = O, there is nothing to prove. So suppose we have such 
(x!,y',...,2*,y*). Observe that the Hamiltonian vector field of any con- 
stant function is zero. Therefore, because all of the Poisson brackets of the 
coordinates are constants, the Hamiltonian vector fields of the coordinates 
satisfy 


Because ©: T,M — T* M is an isomorphism and the differentials {dx’, dy'} 
are independent, these Hamiltonian vector fields are all independent in a 
neighborhood of p. Thus by the normal form theorem for commuting vector 


fields, there are coordinates (ut,...,u?") near p such that 
0 a) 
Let 2*+! = u?*+! in this coordinate system. Then x*+! Poisson commutes 
* « y' 
with x’ and y’ fori =1,...,k, because 
t k+1 k+1 0 2k+1 
{x', aT" } = Xpi(x )=a,% = 0, 
x (13.20) 
{y',a*} = Xy(e**) = Autte yrkrl — 9. 
The restriction of w to span{X,zi, Xyi,i = 1,...,k} is nondegenerate, as 


can easily be verified by computing the action of w on these vectors using 
(13.18). Therefore, X,.+1 is independent of {X,:, X,yi,i = 1,...,k} because 
(13.20) implies w(Xpr+1, Xz) = w(Xgrti, Xyi) = 0. Just as before, the 
2k +1 Hamiltonian vector fields of the functions (x', y1,...,2*,y*,2**+1) 
all commute, so we can find new coordinates (v’) such that 


0 
Ov 
fe) 


Oyitktl Xyi, 


=X,, i=1,...,k4+1, 
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Finally, let y**+! = —v**! in these coordinates. The independence condition 
is satisfied as before, and we compute 


Tk O 
nes uF oh = Hoaagr = Ovkt1 (— ase = 
O 
{ai y**1} ea as = ut 0, Ne lt, Lk, 
Ov 
er a 
{yy} = Hy = sae = 0, Gade k 


This completes the inductive step. 


This theorem was first proved in 1882 by Darboux. A much more elegant 
proof was discovered in the 1960s by Jiirgen Moser [Mos65] and Alan We- 
instein [Wei69]. It requires a bit more machinery than we have developed, 
but you can look it up, for example, in [Wei77] or [AM78]. 

The theory of symplectic manifolds is central to the study of classical 
mechanics. Many classical dynamical systems moving under the influence 
of Newton’s laws of motion can be modeled naturally as the flow of a 
Hamiltonian vector field on a symplectic manifold. If one can find one or 
more functions that Poisson commute with the Hamiltonian, then they 
must be constant along the flow, so by restricting attention to a common 
level set of these functions one can often reduce the problem to one with 
fewer degrees of freedom. For much more on these ideas, see [AM78]. 
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Problems 


13-1. 


13-2. 


13-3. 


13-4. 
13-5. 


13-6. 


13-7. 


Let V,W, X € T(M) and f,g € C™~(M). Show that 


(a) [fV,gW] = folV,W] + f(Vg)W — g(Wf)V. 
(b) Sv (fW) = (VA)W + fhvW. 
(c) Liv.w)x = LyhLwx — LwhyX. 


Let V and W be the vector fields of Exercise 13.2(b). Compute the 
flows 0,7 of V and W, and verify that they do not commute by 
finding explicit times s and ¢ such that 0,0, 4,0 Os. 


Give an example of vector fields V, V, and W on R? such that V = 
V = 0/0x along the z-axis but LyW 4 £5W at the origin. [This 
shows that it is really necessary to know the vector field V to compute 
(LyW),; it is not sufficient just to know the vector V,, or even to 
know the values of V along an integral curve of V.] 


Determine all Killing fields on (R”, 9). 


Let M be a smooth manifold and w a 1-form on M. Show that for 
any smooth vector fields X,Y, 


du(X,Y) = X(w(Y)) — Y(w(X)) — o [X,Y]. 
Generalize the result of Problem 13-5 to a k-form w by showing that 


du(X1,...,Xk+1) = oS (-1)1Xj(w(X1,..., Xi,..-, Xe41)) 
1<i<k+1 


+ S- (H1) (XG Mayer Navy Ri Rea) 
1<i<j<kt1 


where the hats indicate omitted arguments. [This formula can be used 
to give a coordinate-free definition of the exterior derivative. However, 
this definition has the serious flaw that, in order to compute the action 
of dw on vectors (X1,..., X%) at a point p € M, one must first extend 
them to vector fields in a neighborhood of p. It is not evident from 
this formula that the resulting value is independent of the extensions 
chosen. ] 


For each k-tuple of vector fields on R*® shown below, either find coor- 
dinates (u!, u?, u3) in a neighborhood of (1,0,0) such that V; = 0/du* 
for 7 =1,...,k, or explain whey there are none. 


) ) 
(a) k=1,; Vata ot aa 
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re) 0 0 
k=2; => — ——, 
(b) MS a aaa 3y 


) ) ) 3) 
= 2: = —_—_ — —_ => i 
(c) k= 2; Vi "5, Ya _> Ve oon Tay 


) 3) ) 3) 


6) 
(d) k=3; Vi=az- — V3 =2z>---2“—. 


By Yaa? 2 Yaz By? 


13-8. Let (W,w) be a symplectic manifold. Show that the Poisson bracket 
satisfies the following identity for all f,g,h € C@(M): 


{tf g},h} a {19 h}, f} an {it hgh = 0. 
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14 


Integral Manifolds and Foliations 


Suppose V is a nonvanishing vector field on a manifold M. The results 
of Chapter 12 tell us that each integral curve of V is an immersion, and 
that locally the images of the integral curves fit together nicely like parallel 
lines in Euclidean space. In particular, the fundamental theorem on flows 
tells us that these curves are determined by the knowledge of their tangent 
vectors. 

In this chapter we explore an important generalization of this idea to 
higher-dimensional submanifolds. The general setup is this: Suppose we 
are given a k-dimensional subspace of T,,M at each point p € M, varying 
smoothly from point to point. (Such a collection of subspaces is called a 
“tangent distribution.” ) Is there a k-dimensional submanifold (called an 
“integral manifold” of the tangent distribution) whose tangent space at 
each point is the given subspace? The answer in this case is more compli- 
cated than in the case of vector fields: There is a necessary condition, called 
involutivity, that must be satisfied by the tangent distribution. The main 
theorem of this chapter, the Frobenius theorem, tells us that this condition 
is also sufficient. 

We will prove the Frobenius theorem in two forms: a local form, which 
says that a neighborhood of every point is filled up with integral manifolds, 
fitting together nicely like parallel subspaces of R”, and a global form, which 
says that the entire manifold is the disjoint union of immersed integral 
manifolds. 
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Tangent Distributions 


Let M be a smooth manifold. Given a smooth vector bundle 7: FE — M,a 
(smooth) subbundle of E is a subset D C E with the following properties: 


(i) D is an embedded submanifold of E. 


(ii) For each p € M, the fiber D, = DM x~1(p) is a linear subspace of 
Ren Tp). 


(iii) With the vector space structure on each D, inherited from E, and 
the projection 7|p: D — M, D is a smooth vector bundle over M. 


Note that the condition that D be a vector bundle implies that the projec- 
tion |p: D — M must be surjective, and that all the fibers D, must have 
the same dimension. 


Exercise 14.1. If DC FE is a smooth subbundle, show that the inclusion 
map lt: D => E is a bundle map. 


A subbundle of TM is called a tangent distribution on M, or just a 
distribution if there is no opportunity for confusion with the use of the term 
“distribution” for generalized functions in analysis. Other common names 
for distributions are tangent subbundles or plane fields. The dimension of 
each fiber of D is called dimension of the distribution. 

The following lemma gives a convenient condition for checking that a 
collection of subspaces {Dp C T,M : p € M} is a distribution. 


Lemma 14.1. Let M be a smooth manifold, and suppose for each p © 
M we are given a k-dimensional linear subspace Dy, C T,M. Then D = 
Lenz D, C TM is a distribution if and only if the following condition is 
satisfied: 


Each point p € M has a neighborhood U on which there 
are smooth vector fields Y,,...,Y~,: U > TM such that (14.1) 
Yilp,---, Yelp form a basis for D, at each p € U. 


Proof. If D is a distribution, then by definition any p € M has a neighbor- 
hood U over which there exists a local trivialization of D, and by Problem 
3-5 there exists a smooth local frame for D over any such set U. Such a local 
frame is by definition a collection of smooth sections o1,...,0,: U — D 
whose images form a basis for D, at each point p € U. The smooth vector 
fields we seek are given by Y; = 100,;, where 1: D — TM is inclusion. 

Conversely, suppose that D satisfies (14.1). Condition (ii) in the defi- 
nition of a subbundle is true by hypothesis, so we need to show that D 
satisfies conditions (i) and (iii). 

To prove that D is an embedded submanifold, it suffices to show that 
each point p € M has a neighborhood U such that DM a~1(U) is an 
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embedded submanifold of s~!(U) C TM. Given p € M, let Yi,...,Yx 
be vector fields defined on a neighborhood of p and satisfying (14.1). The 
independent vectors Yj|p,..., Yg|p can be extended to a basis Yi|p,..-, Yn|p 
for T,M, and then Yz41|p,---,¥Yn|p can be extended to vector fields in a 
neighborhood of p. By continuity, they will still be independent in some 
neighborhood U of p, so they form a local frame for TM over U. By Problem 
3-5 again, this yields a local trivialization s~!(U) — U x R” defined by 


¥YVilat (a (y7s---5y"))- 


In terms of this trivialization, DM a~!(U) corresponds to U x R® = 
{(q,(y!,...,y*,0,...,0))} C Ux R”, which is obviously a regular subman- 
ifold. Moreover, the map ®|pqz-1(v): DN z~1(U) > U x R* is obviously 
a local trivialization of D, showing that D is itself a vector bundle. 


In the situation of the preceding lemma, we say D is the distribution 
(locally) spanned by the vector fields Yi,..., Yx. 


Integral Manifolds and Involutivity 


Suppose D Cc TM is a distribution. An immersed submanifold N Cc M is 
called an integral manifold of D if T,N = D, at each point p € N. The 
main question we want to address in this chapter is the existence of integral 
manifolds. 

Before we proceed with the general theory, let us describe some examples 
of distributions and integral manifolds that you should keep in mind. 


Example 14.2 (Tangent distributions). 


(a) If V is any nowhere-vanishing vector field on a manifold M, then 
V spans a 1-dimensional distribution on M (i.e., Dp = span(V,) for 
each p € M). The smoothness criterion (14.1) is obviously satisfied. 
The image of any integral curve of V is an integral manifold of D. 


(b) In R”, the vector fields 0/0x!,...,0/Ox* span a k-dimensional distri- 
bution. The k-dimensional affine subspaces parallel to R” are integral 
manifolds. 


(c) Define a distribution on R? \ {0} by letting D, be the tangent space 
to the sphere through p and centered at 0. Away from the north 
and south poles, D is locally spanned by the coordinate vector fields 
0/00 and 0/0¢ in spherical coordinates; near the poles, we can use 
spherical coordinates composed with a suitable rotation. Thus D is 
a 2-dimensional distribution on R? \ {0}. Through each point p € 
R® \ {0}, the sphere of radius |p| around 0 is an integral manifold. 
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(d) Let X and Y be the following vector fields on R?: 


() ) ) 
PP at By eee 

and let D be the distribution spanned by X and Y. It turns out that 
D has no integral manifolds. To get an idea why, suppose N is an 
integral manifold through the origin. Because X and Y are tangent 
to N, any integral curve of X or Y that starts in N will have to 
stay in N, at least for a short time. Thus N contains an open subset 
of the z-axis (which is an integral curve of X). It also contains, for 
each sufficiently small x, an open subset of the line parallel to the 
y-axis and passing through (a,0,0) (which is an integral curve of Y). 
Therefore N contains an open subset of the (a, y)-plane. However, 
the tangent planes to points of the (x, y)-plane at points off of the 
x-axis are not contained in D. Therefore, no such integral manifold 
exists. 


The last example shows that, in general, integral manifolds may fail to 
exist. The reason for this failure is expressed in the following proposition. 
We say that a tangent distribution D is involutive if given any pair of local 
sections of D (i.e., vector fields X, Y defined on an open subset of M such 
that X,, Y, € D, for each p), their Lie bracket is also a section of D. We say 
D is integrable if through each point of M there exists an integral manifold 
of D. 


Proposition 14.3. Every integrable distribution is involutive. 


Proof. Suppose X and Y are local sections of D defined on some open 
subset U C M. Let p be any point in U, and let N be an integral manifold 
of D passing through p. The fact that X and Y are sections of D means 
that X and Y are tangent to N. By Proposition 13.4, [X, Y] is also tangent 
to N, and therefore [X,Y], € Dp. 


The next lemma shows that the involutivity condition does not have to 
be checked for every pair of vector fields, just those of a local frame near 
each point. 


Lemma 14.4. Let DC TM be a distribution. If in a neighborhood of every 
point of M there exists a local frame (Vi,...,Vx) for D such that [V;, Vj] 
is a section of D for each i,j =1,...,k, then D is involutive. 


Proof. Suppose the hypothesis holds, and suppose X and Y are sections 
of D over some open subset U C M. Given p € M, choose a local frame 
(Vi,..-,; Ve) satisfying the hypothesis in a neighborhood of p, and write 
X = X'V, and Y = Y'YV;. Then (using the result of Problem 13-1), 
[X,Y] = [X*V;, Y7Vj] 
= X*YI1V,, Vil + X*(VY)V; — YF (Vj XV. 
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It follows from the hypothesis that this last expression is a section of D. 


The Frobenius Theorem 


In Example 14.2, all of the tangent distributions we defined except the last 
one had the property that there was an integral manifold through each 
point. Moreover, these submanifolds all “fit together” nicely like parallel 
affine subspaces of R”. Given a k-dimensional distribution D C TM, let 
us say that a coordinate chart (U,y) on M is flat for D if at points of U, 
D is spanned by the first k coordinate vector fields 0/0z!,...,0/0x*. It is 
obvious that each slice of the form #*+! = c*t!, ... , x” =c” for constants 
ck+1... ce” is an integral manifold of D. This is the nicest possible local 
situation for integral manifolds We say that a distribution D C TM is 
completely integrable if there exists a flat chart for D in a neighborhood 
of every point of M. Obviously every completely integrable distribution is 
integrable and therefore involutive. 

The next theorem is the main result of this chapter, and indeed one 
of the central theorems in smooth manifold theory. (The others are the 
inverse function theorem, the fundamental theorem on flows, and Stokes’s 
theorem.) 


Theorem 14.5 (Frobenius). Every involutive distribution is completely 
integrable. 


Proof. The canonical form theorem for commuting vector fields (Theorem 
13.10) implies that any distribution locally spanned by commuting vector 
fields is completely integrable. Thus it suffices to show that any involutive 
distribution is locally spanned by commuting vector fields. 

Let D be a k-dimensional involutive distribution on an n-dimensional 
manifold M. Given p € M, choose a neighborhood U of p on which there 
exist coordinates (x',...,2”) centered at p and a local frame Yj,..., Yx for 
D. By a linear change of coordinates, we may assume that 0/0xz'|, = Yilp. 

Let I: U — R* be the smooth map whose coordinate representation is 
the projection onto the first k coordinates: H(a!,...,2") = (#!,...,2*). 
This induces a smooth map II, : TU — TR*, which can be written 


k 


ee, . O 
(Se 2] y= a 
i=l Ox q i=l Ox I(q) 
(Notice that the summation is only over i = 1,..., & on the right-hand side.) 


Clearly, the restriction of II, to Dp C T,M is an isomorphism, and thus by 
continuity the same is true of IL,: Dg — Trq)R* for gq in a neighborhood 
of p. The matrix entries of II,|p with respect to the frames {Y;|,} and 
{0/0x1|11(q) } are smooth functions of q, and thus so are the matrix entries 
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of (l.|p,) : Ty(q)R* — Dg. Define a new local frame X,,...,X, for D 
near p by 


0 


Xilq = (H|p,) dat i 


(14.2) 
a) 
The theorem will be proved if we can show that [X;,X,] = 0 for all i, j. 

First observe that X; and 0/0x* are U-related, because (14.2) implies 
that 


0 


Oxi = (lp.) Maly Me NG |e 


II(q) 


Therefore, by the naturality of Lie brackets, 


0 0 


a a — 0. 
Ox* ali 


TL. [Xi, X5]q = | 


Since [X;,.X,] takes its values in D by involutivity, and IL, is injective on 
D, this implies that [X;, Xj], = 0 for each q, thus completing the proof. 


One immediate consequences of the theorem is the following lemma, 
which we will use repeatedly. 


Proposition 14.6 (Local Structure of Integral Manifolds). Let D 
be an involutive k-dimensional distribution on a smooth manifold M, and 
let (U,y) be a flat chart for D. If N is any integral manifold of D, then 
NOU is a countable disjoint union of open subsets of k-dimensional slices 
of U, each of which is open in N and embedded in M. 


Proof. Because the inclusion map 1: N — M is continuous, NOU = .~!(U) 
is open in N, and thus consists of a countable disjoint union of connected 
components, each of which is open in N. 

Let V be any component of NOU. We will show first that V is contained 
in a single slice. Choosing some p € V, it suffices to show that x(q) = x*(p) 
fori = k+1,...,n for any gq € V. Since V is connected, there exists a 
path y in V from p to q, which we may take to be smooth by Problem 
6-1. Because y lies in V and V is an integral manifold of D, we have 
y(t) € Tya)V = D5). Because D is spanned by 0/0z1,...,0/dx* in U, 
this implies that the last n — k components of y/(t) are zero: (y’)/(t) = 0 
for i > k+1. Since the domain of y is connected, this means 7’ is constant 
for these values of i, so p and q lie in a single slice S. 

Because S is embedded in M, the inclusion map V — M is also smooth 
as a map into S' by Corollary 5.38. The inclusion V — S'is thus an injective 
immersion between manifolds of the same dimension, and therefore a local 
diffeomorphism, an open map, and a homeomorphism onto an open subset 
of S. 
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This local characterization of integral manifolds implies the following 
strengthening of our theorem about restricting the range of a smooth map. 
(This result will be used in Chapter 15.) 


Proposition 14.7. Suppose H C N is an integral manifold of an invo- 
lutive distribution D on N. If F: M — N is a smooth map such that 
F(M) CH, then F is smooth as a map from M to H. 


Proof. Let p € M be arbitrary, and set q = F(p) € H. Let (y',...,y") 
be flat coordinates for D on a neighborhood U of g. Choose coordinates 
(x) for M on a connected neighborhood B of p. Writing the coordinate 
representation of F' as 


(y',...59") = (Fi (z),...,F"(2)), 


the fact that F(B) C H means that the coordinate functions F*t!,...,F” 
take on only countably many values. Because B is connected, the intermedi- 
ate value theorem implies that these coordinate functions are constant, and 
thus F(B) lies in a single slice. On this slice, (y',...,y*) are coordinates 
for H, so F: N — H has the local coordinate representation 


(yryesy") = (F'@))- 3 F*@), 


which is smooth. 


Applications 


In the next chapter, we will see some geometric applications of the Frobe- 
nius theorem. In this chapter, we concentrate on some applications to the 
study of partial differential equations (PDEs). 

Our first application is not so much an application as a simple rephrasing 
of the theorem. Because explicitly finding integral manifolds boils down to 
solving a system of PDEs, we can interpret the Frobenius theorem as an 
existence and uniqueness result for such equations. 


Suppose we are given n — 1 smooth vector fields Yj,..., Y¥,—1 on an open 
subset of R”, and suppose we seek a smooth function f that satisfies the 
system of equations Y, f = --- Y,-1f = 0. Away from the critical points of 


f, the level sets of f will be integral manifolds of the distribution spanned 
by the Y;’s. Thus the involutivity of this distribution is a necessary and 
sufficient condition for the existence of a solution f on any open set. 


Example 14.8. Consider the system 


of of 
2 | = 
a5, + 205, = 9 (14.3) 


+ 2a = 0, 3 
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of PDEs for a real-valued function f of three variables. It is an example of a 
linear, first-order system of PDEs for f (linear because the left-hand sides 
of (14.3) depend linearly on f, and first-order because it involves only first 
derivatives of f). It is also overdetermined, because there are more equations 
than unknown functions. In general, overdetermined systems have solutions 
only if they satisfy certain compatibility conditions; in this case, involutivity 
is the key. 
To see whether (14.3) has nonconstant solutions, we let X and Y denote 
the vector fields 
) 


6) 6) 
X = -222?— 4+ 2¢— Y = -32? Qy—. 
x aa "Oz? ° Oy Vaz 


These span a 2-dimensional distribution D on the subset U of R? where 
z #0, as you can easily check. By direct computation, if z 4 0, 


6) O Ag Ay 
X,Y|=-12 = Ys X 
[X, Y] a, 8yz =F ae 


which shows that D is involutive on U. Therefore, on a neighborhood of 
any point (20, yo, Zo) with zo 4 0, there are flat coordinates (u,v, w) for D; 
because X and Y are tangent to the level sets of w it follows that f = w 
is a solution to (14.3) defined in a neighborhood of (20, yo, 20). In this flat 
chart, (14.3) is equivalent to Of /Ou = Of /Ov = 0, because 0/Ou and 0/dv 
also span D. Therefore, (assuming that the w-slices are connected, which 
will be the case if U is small enough), the solutions in U are precisely those 
smooth functions that depend on w alone in these coordinates, i.e., those 
of the form f(x,y,z) = g(w(a,y, z)) for some smooth function g of one 
variable. 


Exercise 14.2. What can you say about solutions in a neighborhood of a 
point (xo, yo, 0)? 


The next application is more substantial. We introduce it first in the case 
of a single function f(a, y) of two independent variables, in which case the 
notation is considerably simpler. 

Suppose we seek a solution f to the system 


of 


a y) = rade f(%,y)), 
Of (14.4) 
By 6 y) —, B(x,y, f(2,¥)), 

y 


where a and ( are smooth functions defined on some open subset U C R®. 
This is an overdetermined system of (possibly nonlinear) first-order partial 
differential equations. 
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To determine the necessary conditions that a and ( must satisfy for 
solvability of (14.4), assume f is a solution on some open set in R?. Because 
0? f /Oxdy = 0? f /OyOx, (14.4) implies 


5, (olen fey) — S (Ale,y, Foy) 


and therefore by the chain rule 


Oa | 
Oy 


This is true at any point (x,y,z) € U provided there is a smooth solution 
f with f(x,y) = z. In particular, (14.5) is a necessary condition for (14.4) 
to have a solution in a neighborhood of any point (29, yo) with arbitrary 
initial value f (xo, yo) = 20. Using the Frobenius theorem, we can show that 
this condition is sufficient. 


(14.5) 


Oa 
PS, 7 


Proposition 14.9. Suppose a and 3 are smooth functions defined on some 
open set U C R® and satisfying (14.5) there. For any (x0, yo, 20) € U, 
there is a neighborhood V of (xo, yo) in R? and a unique smooth function 
f: VOR satisfying (14.4) and f(xo0, yo) = 20. 


Proof. The idea of the proof is that the system (14.4) determines the partial 
derivatives of f in terms of its values, and therefore determines the tangent 
plane to the graph of f at each point in terms of the coordinates of the 
point on the graph. This collection of tangent planes defines a 2-dimensional 
distribution on U, and (14.5) is equivalent to the involutivity condition for 
this distribution. 

If there were a solution f on an open set V C R?, the map ©: V > R? 
given by 


B(x, y) z= (x, Y, f(x,y)) 


would be a diffeomorphism onto the graph I'(f) C R?. At any point p = 
®(z,y), the tangent space T,I(f) is spanned by the vector fields 


0 ) 


oe Ox ~ Ox|, | Ox (2,9) Oz|,’ 
(x,y) P P (14.6) 
ol 22) Fay 2 | 
oy (z,y) oy P dy 8 Pp 
The system (14.4) is satisfied if and only if 


0 0 0 
®, Seyi ot sae +a(x,y, f(x, y)) aD ’ 
Ox ( Ox Oz 
x,y) Pp Pp (14.7) 
2) =F) saeurey 2 
oy (x,y) Oy Pp a ; Oz Pp 
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Let X and Y be the vector fields 


6) 

_ ax | a(x, y, 2) 5 ’ 
0 

= at B(x, ¥, 2) a 


on U, and let D be the distribution on U spanned by X and Y. A little 
linear algebra will convince you that (14.7) holds if and only if [(f) is an 
integral manifold of D. A straightforward computation using (14.5) shows 
that [X,Y] = 0, so through each point p = (20, yo, 20) € U there is an 
integral manifold N of D. Let F: W — R be a defining function for N 
on some neighborhood W of p; for example, we could take F' to be the 
third coordinate function in a flat chart. The tangent space to N at each 
point p € N (namely D,) is equal to the kernel of F.: T,R* — T,R. Since 
0/0z|p ¢ Dy at any point p, this implies that OF /Oz # 0 at p, so by the 
implicit function theorem N is the graph of a smooth function z = f(x,y) 
in some neighborhood of p. You can verify easily that f is a solution to the 
problem. Uniqueness follows immediately from Proposition 14.6. 


There is a straightforward generalization of this result to higher dimen- 
sions. The general statement of the theorem may seem hopelessly compli- 
cated, but verifying the necessary conditions in specific examples usually 
just amounts to computing mixed partial derivatives and applying the chain 
rule. 


Proposition 14.10. Suppose a’, a= 1,...,k, 7 = 1,...,n are smooth 
functions defined on some open set U C R"** and satisfying 


day |, Oa _ Aaj, || Oak, 
+ Ap => - + Q,. 3 
Oxk Oz! ~— Ox) J Oz! 
where we use (x,z) = (x1,...,2",2z1,...,2*) to denote a point in R'+*. 


For any (xo,20) € U, there is a neighborhood V of x in R” and unique 
smooth map f: V > R* satisfying 
of’ 
Oxd 
and f(2o) = ZQ- 


Exercise 14.3. Prove Proposition 14.10. 


(fie = OG ones lies gd (e)) 


Foliations 


When we put together all the maximal integral manifolds of a k-dimensional 
involutive distribution D, we obtain a decomposition of M into k- 
dimensional submanifolds that “fit together” locally like the slices in a 
flat chart. 
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We define a foliation of dimension k on an n-manifold M to be a col- 
lection F of disjoint, connected, immersed k-dimensional submanifolds of 
M (called the leaves of the foliation), whose union is M, and such that in 
a neighborhood of each point p € M there is a chart (U,(x‘)) such that 
each leaf of the foliation intersects U in either the empty set or a countable 
union of k-dimensional slices of the form 2**+! = ck+1, ... , 2” =e". (Such 
a chart is called a flat chart for the foliation.) 


Example 14.11 (Foliations). 


(a) The collection of all k-dimensional affine subspaces of R” parallel to 
R* is a k-dimensional foliation of R”. 


(b 


mn 


The collection of all open rays of the form {Arp : A > O} is a L- 
dimensional foliation of R” \ {0}. 


(c) The collection of all spheres centered at 0 is an (n — 1)-dimensional 
foliation of R” \ {0}. 


(d) The collection of all circles of the form S! x {q} Cc T? as q ranges 
over S! yields a foliation of the torus T?. A different foliation of T? 
is given by the collection of circles of the form {p} x S! as p ranges 
over S!. 


(e) If @ is a fixed irrational number, the images of all curves of the form 
y(t) = (cost, sin t, cos(at + 8), sin(at + 0)) 


as 0 ranges over [0, 27) form a 1-dimensional foliation of the torus in 
which each leaf is dense (cf. Example 5.3 and Problem 5-4). 


(f) The collection of curves in R? satisfying one of the following equations 
is a foliation of the plane: 


y = sec(x + kr) +, keEZ,ceER; 
“us 
=-4/ le Z. 
x ath 


(g) If we rotate the curves of the previous example around the z-axis, we 
obtain a 2-dimensional foliation of R° in which some of the leaves are 
diffeomorphic to disks and some are diffeomorphic to annuli. 


The main fact about foliations is that they are in one-to-one correspon- 
dence with involutive distributions. One direction, expressed in the next 
lemma, is an easy consequence of the definition. 


Lemma 14.12. Let F be a foliation of a smooth manifold M. The collec- 
tion of tangent spaces to the leaves of F forms an involutive distribution on 


M. 
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Exercise 14.4. Prove Lemma 14.12. 


The Frobenius theorem allows us to conclude the following converse, 
which is much more profound. By the way, it is worth noting that this result 
is one of the two main reasons why the notion of immersed submanifold 
has been defined. 


Theorem 14.13 (Global Frobenius Theorem). Let D be an involu- 
tive k-dimensional tangent distribution on a smooth manifold M. The col- 
lection of all maximal connected integral manifolds of D forms a foliation 


of M. 
The theorem will be a consequence of the following lemma. 


Lemma 14.14. Let {Nahaea be any collection of connected integral man- 
ifolds of D with a point p in common. Then N = YU, Na has a unique 
smooth manifold structure making it into a connected integral manifold of 
D in which each Nq is an open submanifold. 


Proof. Define a topology on N by declaring a subset U of N to be open 
if and only if UM Nq is open in Nq for each a. It is easy to check that 
this is a topology. To prove that each Ng is open in N, we need to show 
that Na Ng is open in Ng for each @. Let g be an arbitrary point of 
Na Ng, and choose a flat chart for D on a neighborhood U of qg. Let Va, 
Vg denote the components of N.MU and Ng NU, respectively, containing 
q. By the preceding lemma, V, and Vg are open subsets of single slices with 
the subspace topology, and since both contain q, they both must lie in the 
same slice S. Thus V1 Vg is open in S and also in both N, and Ng. Since 
each q € NaM Ng has a neighborhood in Ng contained in the intersection, 
it follows that Na Ng is open in Ng as claimed. Clearly this is the unique 
topology on N with the property that each N,, is a subspace of N. 

With this topology, N is locally Euclidean of dimension k, because each 
point q € N has a Euclidean neighborhood V in some Ng, and V is an 
open subset of N because its intersection with each Ng is open in Ng by 
the argument in the preceding paragraph. Moreover, the inclusion map 
N <— M is continuous: For any open subset U C M, UN N is open in N 
because UM Ng is open in Na for each a. 

To see that N is Hausdorff, let g,q’ € N be given. There are disjoint 
open sets U,U' C M containing q and q’, respectively, and then (because 
inclusion N — M is continuous) NOU and NNU’ are disjoint open subsets 
of N containing gq. 

Next we show that N is second countable. We can cover M with count- 
ably many flat charts for D, say {W;}ien. It suffices to show that NNW; is 
contained in a countable union of slices for each 7, for then we can choose 
a countable basis for the portion of N in each such slice, and the union of 
all such bases forms a countable basis for the topology of N. 


Foliations 367 


Suppose W;, is one of these flat charts and S C W, is a slice containing 
a point gq € N. There is some connected integral manifold Ng containing 
p and q. Because connected manifolds are path connected (***appendix?), 
there is a continuous path y: [0,1] — Nq connecting p and gq. Since (0, 1] 
is compact, there exist finitely many numbers 0 = to < ty <---> <tm=1 
such that 7[t;-1,t;] is contained in one of the flat charts W;, for each 
j. Since 7[t;-1,t;] is connected, it is contained in a single component of 
W;, O Na and therefore in a single slice S;, C Wi,. 

Let us say that a slice S of some W, is accessible from p if there is a 
finite sequence of indices ig,...,%m and for each 7; a slice S;, C W;,, with 
the properties that p € S;,, S;,, = S, and $;, Si,,, # @ for each j. The 
discussion in the preceding paragraph showed that every slice that contains 
a point of N is accessible from p. To complete the proof of second count- 
ability, we just note that S;, is itself an integral manifold, and therefore 
it meets at most countably many slices of W;,,, by Proposition 14.6; thus 
there are only countably many slices accessible from p. Therefore, N is a 
topological manifold of dimension k. It is connected because it is a union 
of connected subspaces with a point in common. 

To construct a smooth structure on N, we define an atlas consisting of 
all charts of the form (SN.N,w), where S is a single slice of some flat chart, 
and w: S — R* is the map whose coordinate representation is projection 
onto the first k coordinates: a(a1,...,2”) = (a',...,2"). Because any slice 
is an embedded submanifold, its smooth structure is uniquely determined, 
and thus whenever two such slices S, S’ overlap the transition map 1’ 0 ~ 
is smooth. 

With respect to this smooth structure, the inclusion map N — M is 
an immersion (because it is an embedding on each slice), and the tangent 
space to N at each point q € N is equal to D, (because this is true for 
slices). The smooth structure is uniquely determined by the requirement 
that the inclusion N <— M be an immersion, because this implies N is 
locally embedded and thus its smooth structure must match those of the 
slices on small enough open sets. 


Proof of the global frobenius theorem. For each p € M, let Ly be the union 
of all connected integral manifolds of D containing p. By the preceding 
lemma, Ly is a connected integral manifold of D containing p, and it is 
clearly maximal. If any two such maximal integral manifolds L, and Ly 
intersect, their union L, U L, is an integral manifold containing both p 
and p’, so by maximality L, = L,. Thus the various maximal integral 
manifolds are either disjoint or identical. 

If (U, y) is any flat chart for D, then L, QU is a countable union of open 
subsets of slices by Proposition 14.6. For any such slice S, if L,MS is neither 
empty nor all of S, then L, US is a connected integral manifold properly 
containing L,, which contradicts the maximality of L,. Therefore, L,1U 
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is precisely a countable union of slices, so the collection {L, : p € M} is 
the desired foliation. 


Exercise 14.5. If M and N are smooth manifolds, show that TM and TN 
define integrable distributions on M x N, whose leaves are diffeomorphic to 
M and N, respectively. 
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Problems 


14-1. 


14-2. 


14-3. 


14-4. 


If G is a connected Lie group acting smoothly, freely, and properly 
on a smooth manifold M, show that the orbits of G form a foliation 
of M. 


Let D be the distribution on R® spanned by 


0 O O 
aa Msghyiees. © Ayr eas 
On| 8 az’ Oy 


x= 


(a) Find an integral submanifold of D passing through the origin. 
(b) Is D involutive? Explain your answer in light of part (a). 


Of the systems of partial differential equations below, determine 
which ones have solutions z(x, y) (or, for part (c), z(a, y) and w(z, y)) 
in a neighborhood of the origin for arbitrary positive values of z(0,0) 
(respectively, z(0,0) and w(0,0)). 


(a) Oe COS Y; Of a l ta 

a oat SY; i zlog ztany. 
Oz Oz 
_- = et. nee ee Ye 

(b) ano ay = ee 

wo) an ay, iy, OW 
Ox , Oy’ Ox , Oy 


This problem outlines an alternative characterization of involutivity 
in terms of differential forms. Suppose D is a k-dimensional tangent 
distribution on an n-manifold M. 


(a) Show that near any point of M there are n — k independent 
1-forms w!,...,w"~* such that X € D if and only if w'(X) =0 


fori =1,...,k. 

(b) Show that D is involutive if and only if whenever (w!,...,w”~*) 
are forms as above, there are 1-forms {a :i,j =1,...,n—k} 
such that 


k 
dw’ = y ay Aw!, 
j=l 


(Hint: Use Problem 13-5.] 


Let A*(M) denote the vector space A°(M)@---6A"(M). With 
the wedge product, A*(M) is an associative ring. Show that the 
set 


— 
fo) 
wo 


J(D) = {w € A*(M) : w|p = 0} 


is an ideal in A*(M). 
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(d) An ideal J is said to be a differential ideal if d(J) C J, that is, if 
whenever w € J, dw € J as well. Show that D is involutive if and 
only if J(D) is a differential ideal. 


15 
Lie Algebras and Lie Groups 


The set of vector fields on a Lie group that are invariant under all left 
translations forms a finite-dimensional vector space, which carries a nat- 
ural bilinear product making it into an algebraic structure known as a Lie 
algebra. Many of the properties of a Lie group are reflected in the algebraic 
structure of its Lie algebra. 

In this chapter, we introduce the definition of an abstract Lie algebra, de- 
fine the Lie algebra of a Lie group, and explore some of its important prop- 
erties, including the relationships among Lie algebras, homomorphisms of 
Lie groups, and one-parameter subgroups of Lie groups. Later we introduce 
the exponential map, a smooth map from the Lie algebra into the group 
that shows in a very explicit way how the group structure near the identity 
is reflected in the algebraic structure of the Lie algebra. The culmination 
of the chapter is a complete description of the fundamental correspondence 
between Lie groups and Lie algebras: There is a one-to-one correspondence 
between finite-dimensional Lie algebras and simply-connected Lie groups, 
and all of the connected Lie groups with a given Lie algebra are quotients 
of the simply connected one by discrete normal subgroups. 


Lie Algebras 


A Lie algebra is a real vector space b endowed with a bilinear map bxb — 6, 
denoted by (X,Y) + [X,Y] and called the bracket of X and Y, satisfying 
the following two properties for all X,Y, Z € b: 
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(i) ANTISYMMETRY: [X,Y] = —[Y, X]. 
(ii) JAcoBI IDENTITY: [X, [Y, Z]] + [Y, [Z, X]] + [Z, [X, Y]] = 0. 


Notice that the Jacobi identity is a substitute for associativity, which does 
not hold in general for brackets in a Lie algebra. 


Example 15.1 (Lie algebras). 


(a) The vector space M(n,R) of n x n real matrices is an n-dimensional 
Lie algebra under the commutator bracket 


[A, B] = AB— BA. 


Antisymmetry is obvious from the definition, and the Jacobi iden- 
tity follows from a straightforward calculation. When we are thinking 
of M(n,R) as a Lie algebra with this bracket, we will denote it by 
gl(n,R). 


(b) Similarly, gl(n,C) is the 2n?-dimensional (real) Lie algebra obtained 
by endowing M(n, C) with the commutator bracket. 


(c) The space J(M) of all smooth vector fields on a smooth manifold 
M is an infinite-dimensional Lie algebra under the Lie bracket by 
Lemma 13.1. 


(d) If (M,w) is a symplectic manifold, the space C®°(M) becomes a Lie 
algebra under the Poisson bracket. Problem 13-8 shows that this 
bracket satisfies the Jacobi identity. 


(e) Any vector space V becomes a Lie algebra if we define all brack- 
ets to be zero. Such a Lie algebra is said to be abelian. (The name 
refers to the fact that the bracket in most Lie algebras, as in the two 
preceding examples, is defined as a commutator operation in terms 
of an underlying associative product; so “abelian” refers to the fact 
that all brackets are zero precisely when the underlying product is 
commutative.) 


(f) If g and § are Lie algebras, then g x h is a Lie algebra with the bracket 
operation defined by 


(X,Y), (5 Y)] = ((X, X’), [YY]. 
With this bracket, g x § is called a product Lie algebra. 


If 6 is a Lie algebra, a linear subspace a C 6 is called a Lie subalgebra of 
b if it is closed under Lie brackets. In this case a is itself a Lie algebra with 
the same bracket operation. 

If a and 6 are Lie algebras, a linear map A: a — 6 is called a Lie algebra 
homomorphism if it preserves brackets: A[X,Y] = [AX, AY]. An invertible 
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Lie algebra homomorphism is called a Lie algebra isomorphism. If there 
exists a Lie algebra isomorphism from a to 6, we say they are isomorphic 
as Lie algebras. 


Exercise 15.1. Verify that the kernel and image of a Lie algebra homo- 
morphism are Lie subalgebras. 


Exercise 15.2. If g and § are Lie algebras and A: g — § is a linear map, 
show that A is a Lie algebra homomorphism if and only if A[E;,E;] = 
[AE;, AE;] for some basis (£1,..., En) of g. 


Now suppose G is a Lie group. Recall that each element g € G defines a 
diffeomorphism L,: G — G called left translation, given by L,(h) = gh. A 
smooth vector field X € J(G) is said to be left-invariant if it is invariant 
under all left translations: L,,X = X for every g € G. (Because Ly, is a 
diffeomorphism, L,,X is a well-defined vector field on G, and LyX = X 
means Lg.(Xq’) = Xgq for every pair g,g’ of elements of G.) 


Lemma 15.2. Let G be a Lie group, and let g denote the set of all left- 
invariant vector fields on G. Then g is a Lie subalgebra of T(M). 


Proof. Because Ly,(aX + bY) = aLg.X + bLgsY, it is clear that g is a 
linear subspace of J(M). By the naturality of Lie brackets, if X,Y € g, 


Lgx[X, Y] = [Lge X, LgxY] = [X,Y]. 


Thus g is closed under Lie brackets. 


The Lie algebra g is called the Lie algebra of the Lie group G, and is de- 
noted by Lie(G). The fundamental fact is that Lie(G) is finite-dimensional, 
and in fact has the same dimension as G itself, as the following theorem 
shows. 


Theorem 15.3. Let G be a Lie group and let g = Lie(G). The evaluation 
map g > T.G, given by X ++ Xe, is a vector space isomorphism. Thus g 
is finite-dimensional, with dimension equal to dim G. 


Proof. We will prove the theorem by constructing an inverse to the evalu- 
ation map. For each V € 7T.G, define a section V of TG by 


Va = Lg. 


If there is a left-invariant vector field on G whose value at the identity is 
V, clearly it has to be given by this formula. 

First we need to check that V is in fact a smooth vector field. By Lemma 
3.14(c) it suffices to show that Vf is smooth whenever f is a smooth 
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function on an open set U C G. Choose a smooth curve y: (—é,¢) — G 
such that y(0) =e and y'(0) = V. Then for g € U, 


(VA)(g) = Vof 


The expression y(g,t) = f(gy(t)) depends smoothly on (g,t), because it 
is a composition of group multiplication, f, and y. Thus its t-derivative 
depends smoothly on g, and so V f is smooth. 

Next we need to verify that V is left-invariant, which is to_say that 


LnxVg = Vng for all g,h € G. This follows from the definition of V and the 
fact that Ly, 0 Lg = Lng: 


LiwVg = Lnw(LgxV) = LngeV = Vag: 
Thus V € g. 


Finally, we check that the map T: V V is an inverse for the evaluation 
map ¢: X  X,_. On the one hand, given a vector V € T.G, 


e(r(V)) = (V)e=LaV =V, 


which shows that ¢ 07 is the identity on T.G. On the other hand, given a 
vector field X € g, 


T(E(X))g = Xelg = Lg,Xe = Xq, 


which shows that 7 0 € = Idg. 


Example 15.4. Let us compute the Lie algebras of some familiar Lie 
groups. 


(a) Euclidean space R": Left translation by an element b € R” is given by 
the affine map L,(x) = «+ b, whose push-forward Lp, is represented 
by the identity matrix in standard coordinates. This implies that 
a vector field V’0/0z" is left-invariant if and only if its coefficients 
V’ are constants. Because any two constant-coefficient vector fields 
commute (by formula (13.4)), the Lie algebra of R” is abelian, and 
is isomorphic to R” itself with the trivial bracket operation. In brief, 
Lie(R”) & R". 


Lie Algebras 375 


(b) The circle group S': There is a unique unit tangent vector field T on S! 
that is positively oriented with respect to the orientation induced by 
the outward normal. (If @ is any local angle coordinate on an open set 
U cS!', then T = 0/06 on U.) Because left translations are rotations, 
which preserve T’,, it follows that T is left-invariant, and therefore T 
spans the Lie algebra of S'. This Lie algebra is 1-dimensional and 
abelian, and therefore Lie(S') = R. 


(c) Product groups: If G and H are Lie groups, it is easy to check that 
the Lie algebra of the product group G x H is the product Lie algebra 
Lie(G) x Lie(H). 


(d) The n-torus T”: Since T” is the n-fold product of S' with itself, its 
Lie algebra is isomorphic to R x --- x R = R”. In particular, it is 
abelian. If T; is the oriented unit vector field on the ith S! factor, 
then (T1,..., Zn) is a basis for Lie(T”). 


Theorem 15.3 has several useful corollaries. Recall that a smooth mani- 
fold is said to be parallelizable if its tangent bundle admits a global frame, 
or equivalently if its tangent bundle is trivial. 


Corollary 15.5. Every Lie group is parallelizable. 


Proof. Let G be a Lie group and let g be its Lie algebra. Choosing any basis 
X1,...,Xn for g, Theorem 15.3 shows that Xi|e,...,Xn|- form a basis for 
T.G. For each g € G, Lyg.: T-G — T,G is an isomorphism taking X;|. to 
Xi|g, 80 Xilg,---,Xn|g form a basis for T,G at each g € G. In other words 
(X;) is a global frame for G. 


The proof of the preceding corollary actually shows that any basis for 
the Lie algebra of G is a global frame consisting of left-invariant vector 
fields. We will call any such frame a left-invariant frame. 


Corollary 15.6. Every Lie group is orientable. 


Proof. Proposition 10.5 shows that every parallelizable manifold is ori- 
entable. 


A tensor or differential form o on a Lie group is said to be left-invariant 
if Ljo =o forallg eG. 


Corollary 15.7. Every compact oriented Lie group G has a unique left- 
invariant orientation form Q with the property that te Q=1. 


Proof. Let Ey,..., Ep be a left-invariant global frame on G. By replacing 
EF, with —E; if necessary, we may assume this frame is positively oriented. 
Let <',...,€” be the dual coframe. Left invariance of E; implies that 


(Lie*)(E;) = €*(LgeE;) = €*(E;), 
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which shows that L*e' = ¢', so e’ is left-invariant. 


Let Q=e!A---Ae”. Then 
* 2s * 1 * elk = 
[jQ = Loe A+ ALge” =€ Ave NEM =Q, 


so 2 is left-invariant as well. Because 0(Fi,...,Fn) = 1 > 0, Q is an 
orientation form for the given orientation. Clearly any positive constant 
multiple of Q is also a left-invariant orientation form. Conversely, if Q is 
any other left-invariant orientation form, we can write Q. = cQ, for some 
positive number c. Using left-invariance, we find that 


GHP a0 Oe Say, 


which proves that Qisa positive constant multiple of Q. 

Since G is compact and oriented, ie Q makes sense, so we can define 
Q = ( Sg Q)*Q. Clearly Q. is the unique left-invariant orientation form for 
which G has unit volume. 


Remark. The n-form whose existence is asserted in this proposition is called 
the Haar volume form on G, and is often denoted dV. Similarly, the map 
fr te f dV is called the Haar integral. Observe that the proof above did 
not use the fact that G was compact until the last paragraph; thus every 
Lie group has a left-invariant orientation form that is uniquely defined up 
to a positive constant. It is only in the compact case, however, that we can 
use the volume normalization to single out a unique one. 


The General Linear Group 


Before going on with the general theory, we will explore one more funda- 
mental example: the general linear group. In this chapter, let us denote the 
n x n identity matrix by I instead of I,, for brevity. Theorem 15.3 gives a 
vector space isomorphism between Lie(GL(n, R)) and T; GL(n, R) as usual. 
Because GL(n, R) is an open subset of the vector space gl(n, R), its tangent 
space is naturally isomorphic to gl(n, R) itself. The composition of these two 
isomorphisms gives a vector space isomorphism Lie(GL(n, R)) ~ gl(n, R). 

Both Lie(GL(n,R) and gl(n, R) have independently defined Lie algebra 
structures—the first coming from Lie brackets of vector fields and the sec- 
ond from commutator brackets of matrices. The next proposition shows 
that the natural vector space isomorphism between these spaces is in fact 
a Lie algebra isomorphism. Problem 15-14 shows that the analogous result 
holds for the complex general linear group. 


Proposition 15.8 (Lie Algebra of the General Linear Group). 
The composite map 


gl(n, R) — T; GL(n, R) — Lie(GL(n, R)) (15.1) 
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gives a Lie algebra isomorphism between Lie(GL(n,R)) and the matrix al- 
gebra gl(n,R). 


Proof. Using the matrix entries At as global coordinates on GL(n, R) C 
gl(n, R), the natural isomorphism gl(n,R) —— T; GL(n, R) takes the form 


(Bj) a aa 


(Because of the dual role of the indices 7, 7 as coordinate indices and matrix 
row and column indices, it is impossible to use our summation conventions 
consistently in this context, so we will write the summation signs explicitly.) 

Let g denote the Lie algebra of GL(n,R). For any matrix B = (Bi) € 


gl(n,R), the left-invariant vector field B € g corresponding to B is given 


) 


Since L 4 is the restriction of the linear map B+> AB on gl(n,R), its push- 
forward is represented in coordinates by exactly the same linear map. In 
other words, 


3) 
OA’ 


Ba = LaB= Las & Bi 


(15.2) 


Ba= DAB agr| 


ijk 


Given two matrices B,C’ € gl(n, R), the Lie bracket of the corresponding 
left-invariant vector fields is given by 


fon ee ip O 
B= [Day Pan Fae 


ijk 
O 
= A‘B (APC2 
Pa Phage 1Or) BAP 
a S- APCI—_(At ‘ Bl) 0 
nae q * aa JA}. 
O 
= i pI gk PO 
= AiBlct gy - 2 Air aap 
os ee ra) 
4 k a k 


where we have used the fact that 0(A?)/0Aj, is equal to one if p = i and 
q = k and zero otherwise, and Bi and Ci are constants. Evaluating this 
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last expression when A is equal to the identity matrix, we get 


Bee tos ; : 0 
— al 4 Dk 

[B, Clr — D(BiCr = C,. Br) OA’ ; : 

This is the vector corresponding to the matrix commutator bracket [B, C]. 

Since the left-invariant vector field [B,C] is determined by its value at the 


identity, this implies that 
[B,C] = [B,C], 


which is precisely the statement that the composite map (15.1) is a Lie 
algebra isomorphism. 


Induced Lie Algebra Homomorphisms 


In this section, we show that a Lie homomorphism between Lie groups in- 
duces a Lie algebra homomorphism between their Lie algebras, and explore 
some of the consequences of this fact. 


Theorem 15.9. Let G and H be Lie groups and g and h their Lie algebras, 
and suppose F': G > H is a Lie group homomorphism. For every X € g, 
there is a unique vector field in h that is F-related to X. The map F.: g > 
so defined is a Lie algebra homomorphism. 


Proof. If there is any vector field Y € § that is F-related to X, it must 
satisfy Y. = F,X-, and thus it must be uniquely defined by 


VaR 


To show that this Y is F-related to X, we note that the fact that F is a 
homomorphism implies 


F (9192) = F(gi) F (92) 
=> F(Lyg,92) = Lrg) F(92) 
== Fol,=Lprg)oF 
= F, © Digs = Le(g)x o F,. 


Thus 
FLX, = F,(LgxXe) 
= Li(g)s Fe Xe 
= Lr(g)xYe 
= Yr(g)- 
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This says precisely that X and Y are F-related. 

Now, for each X € g, let FX denote the unique vector field in h that is F- 
related to X. It then follows immediately from the naturality of Lie brackets 
that F.[X,Y] = |[F.X, FY], so F, is a Lie algebra homomorphism. 


The map Ff: g — 6 whose existence is asserted in this theorem will be 
called the induced Lie algebra homomorphism. 


Proposition 15.10 (Properties of the Induced Homomorphism). 


(a) The homomorphism Idg.: Lie(G) — Lie(G) induced by the identity 
map of G is the identity of Lie(G). 


(b) If Fi: G— H and Fh: H = K are Lie group homomorphisms, then 
(F5 (e) Fi)» = (Fo). ie) (Fi)«: Lie(G) e Lie(K). 


Proof. Both of these relations hold for push-forwards, and the value of the 
induced homomorphism on a left-invariant vector field X is defined by the 
push-forward of X¢. 


In the language of category theory, this proposition says that the as- 
signment G +> Lie(G), F + F, is a covariant functor from the category 
of Lie groups to the category of Lie algebras. The following corollary is 
immediate. 


Corollary 15.11. Isomorphic Lie groups have isomorphic Lie algebras. 
The next corollary has a bit more substance to it. 


Corollary 15.12 (The Lie Algebra of a Lie Subgroup). Suppose 
HCG is a Lie subgroup. The subset h C Lie(G) defined by 


6 ={X € Lie(G) : X. € T.-H} 
is a Lie subalgebra of Lie(G) canonically isomorphic to Lie(H). 


Proof. Let g = Lie(G) and b = Lie(G). It is clear that the inclusion map 
ty: H @ Gis a Lie group homomorphism, so v7.(h) is a Lie subalgebra 
of g. By the way we defined the induced Lie algebra homomorphism, this 
subalgebra is precisely the set of left-invariant vector fields on G whose 
value at the identity is of the form t4.V for some V € T.H. Since the 
push-forward map ty.: T.-H — T.G is the inclusion of TH as a subspace 
in T.G, it follows that cy.(h) = 6. To complete the proof, therefore, we 
need only show that ty.: h — g is injective, so that it is an isomorphism 
onto its image. If v747,X = 0, then in particular vq, X- = 0. Since uy is an 
immersion, this implies that X. = 0 and therefore by left-invariance X = 0. 
Thus ty. is injective as claimed. 
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Using this corollary, whenever H is a Lie subgroup of G, we will generally 
identify Lie(H) as a subalgebra of Lie(G). It is important to remember that 
elements of Lie(H) are only vector fields on H, and so, strictly speaking, are 
not elements of Lie(G). However, by the preceding lemma, every element of 
Lie(H) corresponds to a unique element of Lie(G), determined by its value 
at the identity, and the injection of Lie(H) into Lie(G) thus determined 
respects Lie brackets; so by thinking of Lie(H) as a subalgebra of Lie(G) 
we are not committing a grave error. 

This identification is especially illuminating in the case of Lie subgroups 
of GL(n, R). 


Example 15.13. Consider O(n) as a Lie subgroup of GL(n,R). By Ex- 
ample 5.26, it is equal to the level set F~'(I), where F: GL(n,R) > 
S(n,IR) is the submersion F(A) = A?A. By Lemma 5.29, Ty O(n) = 
Ker F,: Ty GL(n, R) — T;S(n,R). By the computation in Example 5.26, 
this push-forward is F,B = B™ + B, so 


T; O(n) = {B € gl(n,R) : B? + B=0} 


= {skew-symmetric n x n matrices}. 


We denote this subspace of gl(n,IR) by o(n). The preceding corollary then 
implies that a(n) is a Lie subalgebra of gl(n, R) isomorphic to Lie(O(n)). 
Notice that we did not even have to verify directly that o(n) is a subalgebra. 


As another application, we consider Lie covering groups. A Lie group 
homomorphism F': G — H that is also a smooth covering map is called a 
covering homomorphism, and we say G is a (Lie) covering group of H. By 
Problem 7-5, if G is connected and Ff: G — 4H is a surjective Lie group 
homomorphism with discrete kernel, then F' is a covering homomorphism. 
Every connected Lie group G has a universal covering space G, which is 
naturally a smooth manifold by Proposition 2.8 and a Lie group by Problem 
7-6. We call G the universal covering group of G. 

The next proposition shows how Lie algebras behave under covering ho- 
momorphisms. 


Proposition 15.14. Suppose G and H are Lie groups and F: G — H is a 
covering homomorphism. Then F.: Lie(G) — Lie(H) is an isomorphism. 


Proof. Because a smooth covering map is a local diffeomorphism, the push- 
forward from T.G to T.H is an isomorphism, and therefore the induced Lie 
algebra homomorphism is an isomorphism. 
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One-Parameter Subgroups 


In this section, we explore another set of relationships among Lie algebras, 
vector fields, and Lie groups. It will give us yet another characterization of 
the Lie algebra of a Lie group. 

Let G be a Lie group. We define a one-parameter subgroup of G to be a 
Lie group homomorphism F': R — G. Notice that, by this definition, a one- 
parameter subgroup is nota Lie subgroup of G, but rather a homomorphism 
into G. (However, as Problem 15-5 shows, the image of a one-parameter 
subgroup is a Lie subgroup.) 

We will see shortly that the one-parameter subgroups are precisely the 
integral curves of left-invariant vector fields starting at the identity. Before 
we do so, however, we need the following lemma. 


Lemma 15.15. Every left-invariant vector field on a Lie group is com- 
plete. 


Proof. Let g be the Lie algebra of the Lie group G, let X € g, and let 0 
denote the flow of X. Suppose some maximal integral curve 0 is defined 
on an interval (a,b) C R, and assume that b < oo. (The case a > —oo is 
handled similarly.) We will use left-invariance to define an integral curve 
on a slightly larger interval. 

By Lemma 13.8, left-invariance of X means that 


L of =h0L 15.3 
g g 


whenever the left-hand side is defined. Observe that the integral curve 0°) 
starting at the identity is defined at least on some interval (—e,¢) for ¢ > 0. 
Choose some s € (b—,b), and define a new curve y: (a,s+¢€) — G by 


@) = 0(9) (t) t € (a,b) 
HN Vee tes (iale)) Ge (eee = B): 


By (15.3), when s € (a,b) and |t — s| < ¢, we have 


so the two definitions of y agree where they overlap. 
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Now, 7¥ is clearly an integral curve of X on (a,b), and for to € (s—e, +e) 
we use left-invariance of X to compute 


d 
/ pos 


Lo, (9) (4-s(e)) 


t=to 


d ec 
= Lo.(g)x di 6 Nt - 8) 


t=to 
= Lo, (9) X oe) (to—s) 
= Xy(t0)- 


Thus 7 is an integral curve of X defined for t € (a,s +e). Since s+e>b, 
this contradicts the maximality of 0°). 


Proposition 15.16. Let G be a Lie group, and let X € Lie(G). The inte- 
gral curve of X starting at e is a one-parameter subgroup of G. 


Proof. Let 6 be the flow of X, so that 6°): R > G is the integral curve in 
question. Clearly 6°) is smooth, so we need only show that it is a group 
homomorphism, i.e., that 0°) (s + t) = 0 (s)0) (t) for all s,t € R. Using 
(15.3) once again, we compute 
0°) (8) (t) = Loveys) 4(e) 

= 41 (Lo ves) (€)) 

= (8 (s)) 

= 6:(4s(€)) 

= O45(€) 

= 9 (t +s). 


The main result of this section is that all one-parameter subgroups are 
obtained in this way. 


Theorem 15.17. Every one-parameter subgroup of a Lie group is an in- 
tegral curve of a left-invariant vector field. Thus there are one-to-one cor- 
respondences 


{one-parameter subgroups of G}— Lie(G) > T.G. 


In particular, a one-parameter subgroup is uniquely determined by its initial 
tangent vector in T.G. 


Proof. Let F: R — G be a one-parameter subgroup, and let X = 
F,(d/dt) € Lie(G), where we think of d/dt as a left-invariant vector field on 
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R. To prove the theorem, it suffices to show that F is an integral curve of 
X. Recall that F.(d/dt) is defined as the unique left-invariant vector field 
on G that is F-related to d/dt. Therefore, for any to € R, 


F'(to) = Fy 2 


=X 
dt é F(to)> 


so F is an integral curve of X. 


Given X € Lie(G), we will call the one-parameter subgroup determined 
in this way the one-parameter subgroup generated by X. 

The one-parameter subgroups of the general linear group are not hard 
to compute explicitly. 


Proposition 15.18. For any B € gl(n,R), let 
eP = 2 1 Bh (15.4) 
7 — ki 
=0 


This series converges to an invertible matrix e? € GL(n,R), and the one- 
parameter subgroup of GL(n,R) generated by B € gl(n,R) is F(t) =e”. 


Proof. First we verify convergence. From Exercise A.24 in the Appendix, 
matrix multiplication satisfies |AB| < | A] |B], where the norm is the Euclid- 
ean norm on gl(n,R) under its obvious identification with R”’. It follows 
by induction that |B*| < |B|*. The Weierstrass M-test shows that (15.4) 
converges uniformly on any bounded subset of gl(n,IR) (by comparison with 
the series >, (1/k!)C* = e®). 

Fix B € gl(n,R). The one-parameter subgroup generated by B is an 
integral curve of the left-invariant vector field B, and therefore satisfies the 
ODE initial value problem 


FG) = Bri), 
F(0) =I. 


Using formula (15.2) for B, the condition for F to be an integral curve can 
be rewritten as 


(Fy) ) = FOB, 


or in matrix notation 


We will show that F(t) = e’® satisfies this equation. Since F(0) = I, this 
implies that F' is the unique integral curve of B starting at the identity 
and is therefore the desired one-parameter subgroup. 
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To see that F is differentiable, we note that differentiating the series 
(15.4) formally term-by-term yields the result 


Since the differentiated series converges uniformly on compact sets (because 
it is the same series!), the term-by-term differentiation is justified. A similar 
argument shows that F’(t) = BF(t). By smoothness of solutions to ODEs, 
F is a smooth curve. 

It remains only to show that F(t) is invertible for all t, so that F' actually 
takes its values in GL(n,R). If we let o(t) = F(t)F(—t) = e’?e~*®, then 
a is a smooth curve in gl(n,R), and by the previous computation and the 
product rule it satisfies 


o'(t) = (F()B)F(-t) — F()(BF(-#)) = 0. 
Therefore o is the constant curve o(t) = o(0) = J, which is to say that 


F(t)F(-t) = I. Substituting —t for t, we obtain F(—t)F(t) = I, which 
shows that F(t) is invertible and F(t)~' = F(-t). 


Next we would like to compute the one-parameter subgroups of sub- 
groups of GL(n, R), such as O(n). To do so, we need the following result. 


Proposition 15.19. Suppose H C G is a Lie subgroup. The one- 
parameter subgroups of H are precisely those one-parameter subgroups of 
G whose initial tangent vectors lie in T.H. 


Proof. Let F: R — H be a one-parameter subgroup. Then the composite 
map 


Res G 


is a Lie group homomorphism and thus a one-parameter subgroup of G, 
which clearly satisfies F’(0) € T.H. 

Conversely, suppose F’: R — G is a one-parameter subgroup whose initial 
tangent vector lies in T.H. Let F’: R — H be the one-parameter subgroup 
of H with the same initial tangent vector F’(0) = F’(0) € T.-H C T.G. As 
in the preceding paragraph, by composing with the inclusion map, we can 
also consider F’ as a one-parameter subgroup of G. Since F and F are both 
one-parameter subgroups of G with the same initial tangent vector, they 
must be equal. 
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Example 15.20. If H is a Lie subgroup of GL(n, R), the preceding propo- 
sition shows that the one-parameter subgroups of H are precisely the maps 
of the form F(t) = e’® for B € 6, where h C gl(n,R) is the subalge- 
bra corresponding to Lie(H) as in Corollary 15.12. For example, taking 
H = O(n), this shows that the exponential of any skew-symmetric matrix 
is orthogonal. 


The Exponential Map 


In the preceding section, we saw that the matrix exponential maps gl(n, R) 
to GL(n,R) and takes each line through the origin to a one-parameter 
subgroup. This has a powerful generalization to arbitrary Lie groups. 
Given a Lie group G with Lie algebra g, define a map exp: g — G, called 
the exponential map of G, by letting exp X = F(1), where F is the one- 
parameter subgroup generated by X, or equivalently the integral curve of 
X starting at the identity. 
Example 15.21. The results of the preceding section show that the expo- 
nential map of GL(n, R) (or any Lie subgroup of it) is given by exp A = e4. 
This, obviously, is the reason for the term exponential map. 


Proposition 15.22 (Properties of the Exponential Map). Let G be 
a Lie group and g its Lie algebra. 


(a) The exponential map is smooth. 


(b) For any X € g, F(t) = exptX is the one-parameter subgroup of G 
generated by X. 


(c) For any X € g, exp(s + t)X = expsX exptx. 


(d) The push-forward exp,: Tog — T-G is the identity map, under the 
canonical identifications of both Tog and T.G with g itself. 


(e) The exponential map is a diffeomorphism from some neighborhood of 
0 in g to a neighborhood of e in G. 


(f) For any Lie group homomorphism F': G — H, the following diagram 
commutes: 


Fy 


g iy 
| |x 
G—-H 


F 


(g) The flow 6 of a left-invariant vector field X is given by 6; = Rexptx 
(right multiplication by exptxX). 
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Proof. For this proof, for any X € g we let 6(x) denote the flow of X. To 
prove (a), we need to show that the expression ga) depends smoothly 
on X, which amounts to showing that the flow varies smoothly as the vector 
field varies. This is a situation not covered by the fundamental theorem on 
flows, but we can reduce it to that theorem by the following trick. Define 


a smooth vector field = on the product manifold G x g by 
E(g,x) = (Xg,0) € TyG x Txg = Tig,x)(G X g). 
It is easy to verify that the flow O of = is given by 
Ox(9, X) = (Ax)e(9), X). 


By the fundamental theorem on flows, O is a smooth map. Since exp X = 
m(O1(e, X)), where 7: G x g > G is the projection, it follows that exp is 
smooth. 

Since the one-parameter subgroup generated by X is equal to the integral 
curve of X starting at e, to prove (b) it suffices to show that exptX = 


ae); or in other words that 
(e) (e) 
Dex) (1) 3 Ixy (t). (15.5) 
In fact, we will prove that for all s,t € R, 
Beye) =O eo): (15.6) 


which clearly implies (15.5). 
To prove (15.6), fix t € R and define a smooth curve 7: R — G by 


4(s) = 61%, (st). 


By the chain rule, 
(8) = #0(R)) (st) = tX00), 

so y is an integral curve of the vector field tX. Since (0) = e, by uniqueness 
of integral curves we must have y(s) = ae (s), which is (15.6). This proves 
(b). 

Next, (c) follows immediately from (b), which shows that t > exptxX is 
a group homomorphism. 

To prove (d), let X € g be arbitrary, and let o: R — g be the curve 
a(t) =tX. Then o’(0) = X, and (b) implies 

exp, X = exp, 0’(0) 
= (exp oa)'(0) 


Se exp tx 
t=0 
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Part (e) then follows immediately from (d) and the inverse function theo- 
rem. 

Next, to prove (f) we need to show that exp(Fi.X) = F(exp X) for any 
X €g. In fact, we will show that for all t € R, 


exp(tF, X) = F(exptx). 


The left-hand side is, by (b), the one-parameter subgroup generated by 
FX. Thus, if we put o(t) = F(exptxX), it suffices to show that o is a 
group homomorphism satisfying o/(0) = F,X. We compute 


d d 
‘(0) = — F tX) =F, — tX = FX 
o'(0) = Ff FlexptX) =F. 5] exp 
and 
o(s+t) = F(exp(s+t)X) 

= F(exp 5X exptx) (by (c)) 

= F(expsX)F(exptx) (since F' is a homomorphism) 
= 0(s)o(t). 


Finally, to show that 0(x): = Rexptx, we use part (b) and (15.3) to show 
that for any g € G, 


Rexptx (9) = g exptX 
= L,(exptx) 
= L,(x)+(e)) 
= x)(Lq(e)) 
= 9(x):(9)- 


In the remainder of this chapter, we present a variety of important ap- 
plications of the exponential map. The first is a powerful technique for 
computing the Lie algebra of a subgroup. 


Lemma 15.23. Let G be a Lie group, and let H C G be a Lie subgroup. 
Inclusion H — G induces an isomorphism between Lie(H) and the subal- 
gebra h C Lie(G) given by 


h = {X © Lie(G) : exptX € H for allt € R}. 


Proof. Let X be an arbitrary element of Lie(G). Suppose first that X € h. 
Letting ~ denote the curve y(t) = exptxX, the fact that +(t) lies in H for 
all t means that X. = 7/(0) € T.-H, which means that X is in the image 
of Lie(H) under inclusion (see Corollary 15.12). Conversely, if X is in the 
image of Lie(H), then X. € T.H, which implies by Proposition 15.19 that 
exptX € H for all t. 
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Corollary 15.24. Let GC GL(n,R) be a Lie subgroup, and define a subset 
gCgl(n,R) by 
g=—{Beégl(n,R):e'? €G for allt € R}. 
Then g is a Lie subalgebra of gl(n,R) canonically isomorphic to Lie(G). 


As an application, we will determine the Lie algebra of SL(n, R). First 
we need the following lemma. 


Lemma 15.25. The matriz exponential satisfies the identity 
det e4 = e'4” (15.7) 


Proof. Let A € gl(n,R) be arbitrary, and consider the smooth function 
T: RR given by 


7(t) = € * "4 dete. 


We compute the derivatives of the two factors separately. First, using the 
result of Problem 4-10 and the chain rule, 


dt 
= d(det),ra(e’4 A) 
= (det e*) tr ((e4)~1(e'“.A)) 
= (det e’4) tr A. 

Then, because the trace is linear, 


d 


ae) = ent S( tr tA) 


=e" **4(_ tr A). 


Therefore, by the product rule, 


d d 
/ —trtaA 7 ta tA —trtA 
T(t) =e 7 (det e’“) + dete aE (e ) 


= e #4 det ef4 tr A + det e'4e- * *4(_ tr A) 


Consequently 7(t) = 7(0) = 1 for all t. In particular, taking ¢t = 1, this 
implies (15.7). 


Example 15.26. Let sl(n,R) C gl(n,R) be the set of trace-free matrices: 


sl(n,R) C gl(n, R) = {B € gl(n,R) : tr B = 0}. 
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If B € sl(n,R), then the preceding lemma shows that det e'? = e'? = 1, 
so e’® € SL(n,R) for all t. Conversely, if e’? € SL(n,R) for all t, then 
1 = dete’? = et = et ®, which immediately implies that tr B = 0. 
Thus by Corollary 15.24, sl(n, R) is a Lie subalgebra of gl(n, R), isomorphic 
to Lie(SL(n, R)). 


The next lemma is a technical result that will be used below. 
Lemma 15.27. Let G be a Lie group and g its Lie algebra. 


(a) Ifm: Gx G => G denotes the multiplication map, then ms: TeG x 
T-G => T.G is given by m,.(X,Y)=X+Y. 


(b) If A,B C g are complementary linear subspaces of g, then the map 
Ax B= G given by (X,Y) exp X expY is a diffeomorphism from 
some neighborhood of (0,0) in A x B to a neighborhood of e in G. 


Exercise 15.3. Prove this lemma. 


The following proposition shows how the group structure of a Lie group 
is reflected “infinitesimally” in the algebraic structure of its Lie algebra. 
The second formula, in particular, shows how the Lie bracket expresses the 
leading term in the Taylor series expansion of a group commutator. 

In the statement of the proposition, we use the following standard nota- 
tion from analysis: The expression O(t*) means any g-valued function of t 
that is bounded by a constant multiple of |t|\* as t 0. (The bound can be 
expressed in terms of any norm on g; since all norms on a finite-dimensional 
vector space are equivalent, the definition is independent of which norm is 
chosen.) 


Proposition 15.28. Let G be a Lie group and g its Lie algebra. For any 
X,Y €g, the exponential map satisfies 

(exptX)(exptY) = exp(t(X +Y)+ st? [X,Y] + o(¢?)), 

(exp tX )(exp tY)(exp —tX) (exp —tY) = exp(t?[X, Y] + O(t?)), (15.9) 


whenever t € R is sufficiently close to 0. 


Proof. Let X € g, and let @ denote the flow of X. If f is any smooth 
function defined on an open subset of G, 


d d 
al (gexptX) = = f(8(9)) = Xa f = Xf(gexptX), (15.10) 
because t +> gexptX = @;(g) is an integral curve of X. Applying this same 
formula to the function X f yields 


2 


flgexptX) = x f(gexptX) — X* f(gexptX). 
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In particular, if f is defined in a neighborhood of the identity, evaluating 
these equations at g = e and t = 0 yields 


=Xf(e), (15.11) 


d 
qt (exP tX) ns 


= X"f(e). (15.12) 
t=0 


a? 
qua f (exp tX) 

By Taylor’s theorem in one variable, therefore, we can write 
f(exptX) = f(e) + tX f(e) + $t?X? f(e) + O(¢*). (15.13) 


Now, let X,Y € g be fixed, and define a function u: R? — R by u(s,t) = 
f(exp sX exptY). We will prove the proposition by analyzing the Taylor 
series of u(t,t) about t = 0. First, by the chain rule and (15.11), 

Ou Ou 


—ult,t 0,0 — (0,0 XxX Y : 15.14 
HMO] = Fel0.0) + 0.0) = XH +¥ Ne). (15.14) 
By (15.12), the pure second derivatives of u are given by 


“*(0,0) =X*fe), 5% (0,0) = ¥*s(0). 


Ot 
To compute the mixed second derivative, we apply (15.10) twice to obtain 
Oru ) ) 
0,0) = — — xX Y 
r= 5 (F]_ flewpsXexpey)) 
- 2 Y f(exp sX) (15:15) 
08} = 
= XY f(e). 
Therefore, by the chain rule, 
2 O7u 07u 07u 
t,t 2 0,0) 4 
qe Os Bae (O10) + 2554 010) + ep (0.0) 


= X?f(e) + 2XYf(e)+Y7f(e). 
Taylor’s theorem then yields 
f(exptX exptY) = u(t,t) 
= u(0,0)+¢ ad) + 30? an t) 
dt px dt? = 
+ O(t?) 
= fle) + UX Fle) + YF(e)) 
+ $t°(X? f(e) + 2XY f(e) + Y?f(e)) + O(¢*). 
(15.16) 
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Because the exponential map is a diffeomorphism on some neighbor- 
hood U of the origin in g, there is a smooth curve in U defined by 
7(t) = exp” '(exptX exptY) for t sufficiently near zero. It obviously satis- 
fies y(0) = 0 and 


exptX exptY = exp)(t). 


The implied constant in the O(t?) error term in (15.13) can be taken to be 
independent of X as long as X stays in a compact subset of g, as can be 
seen by expressing the error explicitly in terms of the remainder term in 
the Taylor series of the smooth function f o exp (in terms of any basis for 
g). Therefore, writing the Taylor series of y(t) as 


y(t) =tA+?B+ O(t) 


for some fixed A, B € g, we can substitute y(t) for ¢X in (15.13) and expand 
to obtain 


f(exptX exp tY) = f(exp7(t)) 
= f(e)+tAf(e) + Bf(e) + $A’ f(e) + O(8°). 
(15.17) 
Comparing like powers of t in (15.17) and (15.16), we see that 


Af(e) = Xf(e) + Yf(e), 
Bf(e) + 54° fle) = 2(X7 fle) + 2XYf(e) + Y*f(e)). 
Since this is true for every f, the first equation implies A = X+Y. Inserting 
this into the second equation, we obtain 
Bf(e) = 3(X*f(e) + 2XY Fle) + Y? fle) — a(X + Y)* Fle) 
= 5X°*f(e) + XY f(e) + 5Y7F(e) 
gk Je aNT He) oY lea ai fle) 
= 5(XY fle) -YXf(e)) 
= 3/X, Y]f(e). 


This implies B = $[X,Y], which completes the proof of (15.8). 
Finally, (15.9) is proved by applying (15.8) twice: 


(exp tX )(exp tY) (exp —tX) (exp —tY) 
= exp(t(X + Y) + $t?[X, Y] + O(t°)) x 
exp(t(—X — Y) + 42?[X, Y] + O(#°)) 
= exp(t?[X, Y] + O(t?)). 
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The formulas above are special cases of a much more general formula, 
called the Baker-Campbell-Hausdorff formula, which gives recursive expres- 
sions for all the terms of the Taylor series of y(t) in terms of X, Y, [X,Y], 
and iterated brackets such as [X,[X,Y]] and [Y, |X,[X,Y]]]. The full for- 
mula can be found in [Var84]. 


The Closed Subgroup Theorem 


The next theorem is one of the most powerful applications of the expo- 
nential map. For example, it allows us to strengthen Theorem 7.15 about 
quotients of Lie groups, because we need only assume that the subgroup H 
is topologically closed in G, not that it is a closed Lie subgroup. A similar 
remark applies to Proposition 7.21. 


Theorem 15.29 (Closed Subgroup Theorem). Suppose G is a Lie 
group and H CG is a subgroup that is also a closed subset. Then H is an 
embedded Lie subgroup. 


Proof. By Proposition 5.41, it suffices to show that H is an embedded 
submanifold of G. We begin by identifying a subspace of the Lie algebra of 
G that will turn out to be the Lie algebra of H. 

Let g denote the Lie algebra of G’, and define a subset h C g by 


h ={X €g:exptxX € H for allt € R}. 
We need to show that § is a vector subspace of g. It is obvious from the 
definition that if X € h, then tX € § for all t € R. To see that is closed 


under vector addition, let X,Y € § be arbitrary. Observe that (15.8) implies 
that for any t€ Rand neEN, 


t t t im 
exp —X exp —Y =exp| —(A+Y)+0(— ]], 
n n n n 
and a simple induction using (15.8) again shows that 
t tot 
(ew —X exp “y) = (exp (Ax +O (< =))). 
n n 
= exp (« (xX +Y) col? ) 


Taking the limit as n — oo, we obtain 


t tr 
expt(X +Y) = Jim (exp ma exp “y) ; 
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which is in H because H is closed in G. Thus X + Y € 6, and so hisa 
subspace. (In fact, (15.9) can be used in a similar way to prove that is a 
Lie subalgebra, but we will not need this.) 

Next we will show that there is a neighborhood U of the origin in g on 
which the exponential map of G is a diffeomorphism, and which has the 
property that 


exp(U 1h) = (expU)N H. (15.18) 


This will enable us to construct a slice chart for H near the identity, and 
we will then use left translation to get a slice chart in a neighborhood of 
any point of H. 

If U is any neighborhood of 0 € g on which exp is a diffeomorphism, then 
exp(U M6) Cc (expU) NA by definition of h. So to find a neighborhood 
satisfying (15.18), all we need to do is to show that U can be chosen small 
enough that (expU)NH Cc exp(UNbh). Assume this is not possible. Let {U;} 
be any countable neighborhood basis at 0 € g (for example, a countable 
sequence of coordinate balls whose radii approach zero). The assumption 
implies that for each 7, there exists h; € (expU;)M H such that h; ¢ 
exp(U;M). 

Choose a basis E),..., Ex, for § and extend it to a basis Fy,..., Em» for 
g. Let 6 be the subspace spanned by Fx41,..., £m, so that g = § x b as 
vector spaces. By Lemma 15.27, as soon as 7 is large enough, the map from 
h x 6 to G given by (X,Y) + exp X expY is a diffeomorphism from U; to 
a neighborhood of e in G. Therefore we can write 


h; = exp X; exp Y; 


for some X; € U;Nb and Y; € U;M 6, with Y; 4 0 because h; ¢ exp(U;N 
h). Since {U;} is a neighborhood basis, Y; — 0 as i — oo. Observe that 
exp X; € H by definition of 6, so it follows that exp Y; = (exp X;)~'h; € H 
as well. 

The basis {£;} induces a vector space isomorphism EF: g = R™. Let 
|-| denote the Euclidean norm induced by this isomorphism and define 
ci = |Y;|, so that c; + 0 as i > oo. The sequence {c; 'Y;} lies in the unit 
sphere in b with respect to this norm, so replacing it by a subsequence we 
may assume that c; 'Y; ~ Y € 6, with |Y| = 1 by continuity. In particular, 
Y 4 0. We will show that exptY € AH for all t € R, which implies that 
Y € §. Since 1b = {0}, this is a contradiction. 

Let t € R be arbitrary, and for each 7, let n; be the greatest integer less 
than or equal to t/c;. Then 


which implies 


arten — t| <G- 0, 
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so n;c; — t. Thus 
niYi = (nici)(c; Yi) — tY, 


which implies expn;Y; — exptY by continuity. But exp n;Y; = (exp Y;)” € 
H, so the fact that H is closed implies exptY € H. This completes the 
proof of the existence of U satisfying (15.18). 

The composite map y = Eoexp7!: expU — R™ is easily seen to be a 
coordinate chart for G, and by our choice of basis, y((exp U) NH) = E(UN 
h) is the slice obtained by setting the last m — k coordinates equal to zero. 
Moreover, if h € H is arbitrary, left multiplication L;, is a diffeomorphism 
from expU to a neighborhood of h. Since H is a subgroup, L,(H) = H, 
and so 


L,((exp U) NH) = L;,(expU) nN A, 


and yo Dee is easily seen to be a slice chart for H in a neighborhood of h. 
Thus H is a regular submanifold of G, hence a Lie subgroup. 


The following corollary summarizes the closed subgroup theorem and 
Proposition 5.41. 


Corollary 15.30. If G is a Lie group and H is any subgroup of G, the 
following are equivalent: 


(a) H is closed in G. 


(b) H is an embedded Lie subgroup. 


Lie Subalgebras and Lie Subgroups 


Earlier in this chapter, we saw that a Lie subgroup of a Lie group gives rise 
to a Lie subalgebra of its Lie algebra. In this section, we show that the con- 
verse is true: Every Lie subalgebra corresponds to some Lie subgroup. This 
result has important consequences that we will explore in the remainder of 
the chapter. 


Theorem 15.31. Suppose G is a Lie group with Lie algebra g. If h is any 
Lie subalgebra of g, then there is a unique connected Lie subgroup of G 
whose Lie algebra is h (under the canonical identification of the Lie algebra 
of a subgroup with a Lie subalgebra of g). 


Proof. Define a distribution D C TG by 


Dg ={Xg €T,G: X €h}. 
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If (X1,...,X,%) is any basis for b, then clearly D, is spanned by 
Xi\g,---,Xklg at any g € G. Thus D is locally (in fact, globally) spanned 
by smooth vector fields, so it is a smooth subbundle of T’G. Moreover, be- 
cause [X;,X,] € § for each 2,7, D is involutive. Let H{ denote the foliation 
determined by D, and for any g € G, let J, denote the leaf of H{ containing 


g: 
If g,g’ are arbitrary elements of G, then 


Lgx(Dg') = span(Lg.X1|9’, tee » Lg Xi g/) 
=> span(.X1|gq’, a » Xz gq’) 
— Dgg': 


so D is invariant under all left translations. If M is any connected integral 
manifold of D, then so is L,(M), since 


Ty Dg(M) = Lga(Ty-19g'M) = Lgx(Dg-19') = Dyy- 


If M is maximal, it is easy to see that L,(M) is as well. It follows that left 
multiplication takes leaves to leaves: Lg(Ilg’) = Hyg for any g,g’ € G. 
Define H = He, the leaf containing the identity. We will show that H is 
the desired Lie subgroup. 
First, to see that H is a subgroup, observe that for any h,h’ € H, 


hh! = Ly(h’) € L,(H) = Lp, (He) = KH, = H. 
Similarly, 
ho! = h-te € Ly-1(He) = Ep-1(Ha) = Hy-1, = H. 


To show that H is a Lie group, we need to show that the map p: (h, h’) 
hh’ is smooth as a map from Hx H to H. Because H x H is a submanifold 
of G x G, it is immediate that wu: H x H — G is smooth. Since H is an 
integral manifold of an involutive distribution, Proposition 14.7 shows that 
i is also smooth as a map into H. 

The fact that H is a leaf of F( implies that the Lie algebra of H is 5, 
because the tangent space to H at the identity is D. = {X.: X € bh}. To 
see that H is the unique connected subgroup with Lie algebra h, suppose H 
is any other connected subgroup with the same Lie algebra. Any such Lie 
subgroup is easily seen to be an integral manifold of D, so by maximality 
of H, we must have H C H. On the other hand, if U is the domain of a flat 
chart for D near the identity, then by Proposition 14.6. HU is a union 
of open subsets of slices. Since the slice containing e is an open subset of 
Hf, this implies that H contains a neighborhood V of the identity in H. 
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Moreover, for any other point h € H, L(V) is an open subset of H that is 
also contained in H, so H is open in H. By Problem 7-9, this implies that 
H=H. 


The most important application of Theorem 15.31 is in the proof of the 
next theorem. 


Theorem 15.32. Suppose G and H are Lie groups with G simply con- 
nected, and let g and h denote their Lie algebras. For any Lie algebra homo- 
morphism py: g — h, there is a unique Lie group homomorphism ®: G — H 
such that ®, = y. 


Proof. The Lie algebra of G x H is the product Lie algebra g x h. Let 
€ Cc g x § be the graph of ¢: 


€= {(X, pX): X € g}. 


Then € is a vector subspace of g x h because vy is linear, and in fact it isa 
Lie subalgebra because y is a homomorphism: 


[(X, eX), (X, eX’)] = ([X, X'], [pX, eX") = ([X, XJ, y[X, X']) et. 


Therefore, by the preceding theorem, there is a unique connected Lie sub- 
group K C G x H whose Lie algebra is €. 
The restrictions to K of the projections 


milk: K > G, Talk: K > H 


are Lie group homomorphisms because 7 and 72 are. Let II = m|K: K > 
G. We will show that II is a smooth covering map. Since G is simply 
connected, this will imply that II is a diffeomorphism and thus a Lie group 
isomorphism. 

To show that II is a smooth covering map, it suffices by Problem 7-5 to 
show that II is surjective and has discrete kernel. Consider the sequence of 
maps 


KoGxH-4G 


whose composition is I]. The induced Lie algebra homomorphism II,, is just 
inclusion followed by projection on the algebra level: 


eogxh—g. 
This last composition is nothing more than the restriction to € of the pro- 


jection g x h — g. Because Mh = {0} (since € is a graph), it follows that 
II: € — g is an isomorphism. Because II is a Lie group homomorphism, it 
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has constant rank, and therefore it is a local diffeomorphism and an open 
map. Moreover, its kernel is an embedded Lie subgroup of dimension zero, 
which is to say a discrete group. Because II(K’) is an open subgroup of the 
connected Lie group G, it is all of G by Problem 7-9. Thus II is a surjective 
Lie group homomorphism with discrete kernel, so it is a smooth covering 
map and thus a Lie group isomorphism. 

Define a Lie group homomorphism ®: G — H by ® = 79| oI1~!. Note 
that the definition of ® implies that 


Talk = Pom|xK. 


Because the Lie algebra homomorphism induced by the projection 7: G x 
H — BH is just the linear projection 7,: g x § — h, this implies 


To|e = OB, oT |e: & Ob. 
Thus if X € g is arbitrary, 
pX = m2|e(X, pX) 
= ®, 0 m4 |e(X, pX) 
= OX, 
which shows that ®, = y. 


The proof is completed by showing that ® is the unique homomorphism 
with this property. This is left as an exercise. 


Exercise 15.4. Show that the homomorphism ® constructed in the pre- 
ceding proof is the unique Lie group homomorphism such that ®, = y. 
[Hint: Consider the graph.] 


Corollary 15.33. If G and H are simply connected Lie groups with iso- 
morphic Lie algebras, then G and H are Lie isomorphic. 


Proof. Let g, 6 be the Lie algebras, and let y: g — bh be a Lie algebra 
isomorphism between them. By the preceding theorem, there are Lie group 
homomorphisms ®: G — H and WV: H — G satisfying ®, = y and W, = 
y'. Both the identity map of G and the composition V o ® are maps 
from G to itself whose induced homomorphisms are equal to the identity, 
so the uniqueness part of Theorem 15.32 implies that Vo ® = Id. Similarly, 
® o W = Id, so ® is a Lie group isomorphism. 


A version of this theorem was proved in the nineteenth century by Sophus 
Lie. However, since global topological notions such as simple connectedness 
had not yet been formulated, what he was able to prove was essentially a 
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local version of this corollary. Two Lie groups G and H are said to be 
locally isomorphic if there exist neighborhoods of the identity U C G and 
V C H, and a diffeomorphism F': U > V such that F(gig2) = F'(g1)F (g2) 
whenever gi, gz, and gig2 are all in U. 


Theorem 15.34 (Fundamental Theorem of Sophus Lie). Two Lie 
groups are locally isomorphic if and only if they have isomorphic Lie alge- 
bras. 


The proof in one direction is essentially to follow the arguments in Theo- 
rem 15.32 and Corollary 15.33, except that one just uses the inverse function 
theorem to show that ® is a local isomorphism instead of appealing to the 
theory of covering spaces. The details are left as an exercise. 


Exercise 15.5. Carry out the details of the proof of Lie’s fundamental 
theorem. 


The Fundamental Correspondence Between Lie 
Algebras and Lie Groups 


Many of the results of this chapter show how essential properties of a Lie 
group are reflected in its Lie algebra. This raises a natural question: To 
what extent is the correspondence between Lie groups and Lie algebras (or 
at least between their isomorphism classes) one-to-one? We have already 
seen in Corollary 15.11 that isomorphic Lie groups have isomorphic Lie 
algebras. The converse is easily seen to be false: Both R” and T” have n- 
dimensional abelian Lie algebras, which are obviously isomorphic, but R” 
and T” are certainly not isomorphic Lie groups. However, if we restrict our 
attention to simply connected Lie groups, then we do obtain a one-to-one 
correspondence. The central result is the following theorem. 


Theorem 15.35 (Lie Group—Lie Algebra Correspondence). 

There is a one-to-one correspondence between isomorphism classes of 
finite-dimensional Lie algebras and isomorphism classes of simply con- 
nected Lie groups, given by associating each simply connected Lie group 
with its Lie algebra. 


The proof of this theorem will tie together all the work we have done so 
far on Lie groups and their Lie algebras, together with one deep algebraic 
result that we will state without proof. Let us begin by describing the 
algebraic result. 

Let g be a Lie algebra. A representation of g is a Lie algebra homomor- 
phism p: g > gl(n,R) for some n. If p is injective, it is said to be a faithful 
representation. In this case, it is easy to see that g is isomorphic to p(g), 
which is a Lie subalgebra of gl(n, R). 
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Theorem 15.36 (Ado’s Theorem). Every Lie algebra has a faithful 
representation. 


The proof is long, hard, and very algebraic. It can be found in [Var84]. 


Proof of Theorem 15.35. We need to show that the association that sends 
a simply connected Lie group to its Lie algebra is both surjective and in- 
jective up to isomorphism. Injectivity is precisely the content of Corollary 
15.33. To prove surjectivity, suppose g is any finite-dimensional Lie alge- 
bra. Replacing g with its isomorphic image under a faithful representation 
p: g — gl(n,R), we may assume that g is a Lie subalgebra of gl(n,R). By 
Theorem 15.31, there is a unique Lie subgroup G C GL(n, R) that has g as 
its Lie algebra. Letting G be the universal covering group of G, Proposition 


15.14 shows that Lie(G) = Lie(G) & g. 


What happens when we allow non-simply-connected groups? Because 
every Lie group has a simply connected covering group, the answer in the 
connected case is easy to describe. (For disconnected groups, the answer is 
described in Problem 15-4.) 


Theorem 15.37. Let g be a finite-dimensional Lie algebra. The connected 
Lie groups whose Lie algebras are isomorphic to g are (up to isomorphism) 
precisely those of the form G/T, where G is the simply connected Lie group 
with Lie algebra g, and T ts a discrete normal subgroup of G. 


Proof. Given g, let G be a simply connected Lie group with Lie algebra 
isomorphic to g. Suppose H is any other Lie group whose Lie algebra is 
isomorphic to g, and let yw: Lie(G) — Lie(H) be a Lie algebra isomor- 
phism. Theorem 15.32 guarantees that there is a Lie group homomorphism 
®: G— GF such that ®, = y. Because ¢ is an isomorphism, ® is a local dif- 
feomorphism, and therefore [ = Ker © is a discrete normal subgroup of G. 
Since ® is a surjective Lie homomorphism with kernel T, H is isomorphic 
to G/T by Problem 7-4. 


Exercise 15.6. Show that two connected Lie groups are locally isomor- 
phic if and only if they have isomorphic universal covering groups. 
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Problems 


15-1. 


15-2. 
15-3. 


15-4. 


15-5. 


15-6. 


15-7. 


Let G be a connected Lie group, and U C G any neighborhood of the 
identity. Show that U generates G, i.e., that every element of G can 
be written as a finite product of elements of U. 


Compute the exponential maps of the abelian Lie groups R” and T”. 


Consider S? as the unit sphere in R* with coordinates (w, 2, y, 2). 
Show that there is a Lie group structure on S? in which the vector 
fields 

O ai O : ere, 
x T ’ 

Ow Ox Oy Vaz 

O : 0 : O a O 
X3 =—-Zz 


x + W—. 
Oz 


Ow Ox Oy 


X,= 


X2= 


form a left-invariant frame. [See Problems 7-8 and 3-6. It was shown 
in 1958 by Raoul Bott and John Milnor [BM58] using more advanced 
methods that the only spheres that are parallelizable are S', S*, and 
S’. Thus these are the only spheres that can possibly admit Lie group 
structures. The first two do, as we have seen; it turns out that S’ has 
no Lie group structure.] 


Let g be a finite-dimensional Lie algebra, and let G be the simply 
connected Lie group whose Lie algebra is g. Describe all Lie groups 
whose Lie algebra is g in terms of G and discrete groups. 


Let G be a Lie group. 


(a) Show that the images of one-parameter subgroups in G are pre- 
cisely the connected Lie subgroups of dimension less than or 
equal to 1. 

(b) If H Cc G is the image of a one-parameter subgroup, show that 
H is Lie isomorphic to one of the following: the trivial group 
{e}, R, or S?. 


Prove that there is exactly one nonabelian 2-dimensional Lie algebra 
up to isomorphism. 


Let A and B be the following elements of gl(2, R): 


Ae (eco) Pele 0) 


Compute the one-parameter subgroups of GL(2,R) generated by A 
and B. 
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15-8. Let GL*(n,R) be the subgroup of GL(n,R) consisting of matrices 
with positive determinant. (By Proposition 7.26, it is the identity 
component of GL(n,R).) 


(a) Suppose A € GL*(n,R) is of the form e? for some B € gl(n, R). 
Show that A has a square root, i.e., a matrix C € GL*(n,R) 
such that C? = A. 


(b) Let 


Show that the exponential map exp: gl(2,R) — GL*(2,R) is 
not surjective, by showing that A is not in its image. 


15-9. Let {i,j,k} denote the standard basis of R°, and let H = R x R, 
with basis {1,i,j,k}. Define a bilinear multiplication H x H — H by 
setting 


lqg=q1=4q for all g € H, 


ij=-ji=k, 
jk = —kj =i, 
ki = —ik =j, 


PeapPak=-1, 
and extending bilinearly. With this multiplication, H is called the ring 
of quaternions. 


(a) Show that quaternionic multiplication is associative. 


(b) Show that the set $ of unit quaternions (with respect to the 
Euclidean metric) is a Lie group under quaternionic multiplica- 
tion, and is Lie isomorphic to SU(2). 

(c) For any point q € H, show that the quaternions ig, jg, and kq 
are orthogonal to qg. Use this to define a left-invariant frame on 
8, and show that it corresponds under the isomorphism of (b) 
to the one defined in Problem 15-3. 


15-10. Look up the Cayley numbers, and prove that S’ is parallelizable by 
mimicking as much as you can of Problem 15-9. Why do the unit 
Cayley numbers not form a Lie group? 


15-11. Let A C T(R®) be the subspace with basis {X, Y, Z}, where 


3) 3) Y ) ) 0 3) 
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Show that A is a Lie subalgebra of T(IR?), which is Lie algebra iso- 
morphic to R® with the cross product, and also to the Lie algebra 
0(3) of O(3). 


15-12. Let G be a connected Lie group and g its Lie algebra. 
(a) If X, Y € g, show that [X,Y] = 0 if and only if 
exptX expsY =expsY exptX for all s,t ER. 


(b) Show that G is abelian if and only if g is abelian. 


(c) Give a counterexample when G is not connected. 


15-138. Show that every connected abelian Lie group is Lie isomorphic to 
R* x T! for some nonnegative integers k and I. 


15-14. Define a map 8: GL(n,C) — GL(2n,R) by identifying (2? + 
iyt,..., 2% + iy”) € C” with (c1,y',...,2",y”) € R™. 


(a) Show that ( is an injective Lie group homomorphism, so that 
we can identify GL(n,C) with the Lie subgroup of GL(2n, R) 
consisting of matrices built up out of 2 x 2 blocks of the form 
(fa): 

(b) Under our usual (vector space) isomorphisms T; GL(n,C) = 
gl(n,C) and T;GL(n,R) = gl(n,R), show that the induced 
Lie algebra homomorphism /3,.: Lie(GL(n, C)) — Lie(GL(n, R)) 
induces an injective Lie algebra homomorphism gl(n,C) — 
gl(n, R) (considering both as Lie algebras with the commutator 
bracket). Conclude that Lie(GL(n, C)) is Lie algebra isomorphic 
to the matrix algebra gl(n, C). 


(c) Determine the Lie algebras sl(n, C), u(m), and su(n) of SL(n, C), 
U(n), and SU(n), respectively, as matrix subalgebras of gl(n, C). 


15-15. Show by giving an explicit isomorphism that su(2) and o(3) are iso- 
morphic Lie algebras. 


Appendix 


Review of Prerequisites 


The essential prerequisites for reading this book are a thorough acquain- 
tance with abstract linear algebra, advanced multivariable calculus, and 
basic topology. The topological prerequisites include basic properties of 
topological spaces, topological manifolds, the fundamental group, and cov- 
ering spaces; these are covered fully in my book Introduction to Topological 
Manifolds [Lee00]. In this appendix, we summarize the most important 
facts from linear algebra and advanced calculus that are used throughout 
this book. 


Linear Algebra 


For the basic properties of vector spaces and linear maps, you can consult 
almost any linear algebra book that treats vector spaces abstractly, such as 
[FIS97]. Here we just summarize the main points, with emphasis on those 
aspects that will prove most important in the study of smooth manifolds. 


Vector Spaces 


Let R denote the field of real numbers. A vector space over R (or real vector 
space) is a set V endowed with two operations: vector addition V x V — V, 
denoted by (X,Y) X+Y, and scalar multiplication Rx V — V, denoted 
by (a, X) + aX; the operations are required to satisfy 


(i) V is an abelian group under vector addition. 
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(ii) Scalar multiplication satisfies the following identities: 


a(bX) = (ab)X for all X € V and a,b ER; 
1X =X for all X EV. 


(iii) Scalar multiplication and vector addition are related by the following 
distributive laws: 


(a+b)X =aX +bX for all X € V and a,be R; 
a(X +Y)=aX +aY for all X,Y € V andaeR. 


The elements of V are usually called vectors. When necessary to distinguish 
them from vectors, real numbers are sometimes called scalars. 

This definition can be generalized in two directions. First, replacing R by 
an arbitrary field F everywhere, we obtain the definition of a vector space 
over F. Second, if R is replaced by a ring ®, this becomes the definition of 
a module over R. We will be concerned almost exclusively with real vector 
spaces, but it is useful to be aware of these more general definitions. Unless 
we specify otherwise, all vector spaces will be assumed to be real. 

If V is a vector space, a subset W C V that is closed under vector 
addition and scalar multiplication is itself a vector space, and is called a 
subspace of V. 

Let V be a vector space. A finite sum of the form 4 a’ X;, where 
a’ € Rand X; € V, is called a linear combination of the vectors X1,..., Xx. 
(The reason we write the coefficients a‘ with superscripts instead of sub- 
scripts is to be consistent with the Einstein summation convention, which 
is explained in Chapter 1.) If S is an arbitrary subset of V, the set of all 
linear combinations of elements of S' is called the span of S and is denoted 
by span(S); it is easily seen to be the smallest subspace of V containing S. 
If V = span(S), we say S' spans V. By convention, a linear combination of 
no elements is considered to sum to zero, and the span of the empty set is 


{0}. 


Bases and Dimension 


A subset S' is said to be linearly dependent if there exists a linear relation 
of the form sy a’ X; = 0, where X,...,X, are distinct elements of S$ 
and at least one of the coefficients a’ is nonzero; it is said to be linearly 
independent otherwise. In other words, S is linearly independent if and only 
if the only linear combination of distinct elements of S that sums to zero 
is the one in which all the scalar coefficients are zero. Note that any set 
containing the zero vector is linearly dependent. By convention, the empty 
set is considered to be linearly independent. 


Exercise A.1. Let V be a vector space and SCV. 
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(a) If S is linearly independent, show that any subset of S is linearly 
independent. 

(b) If S is linearly dependent or spans V, show that any subset of V that 
properly contains S' is linearly dependent. 

(c) Show that S is linearly dependent if and only if some element X € S 
can be expressed as a linear combination of elements of S \ {X}. 

(d) If (Xi,...,Xm) is a finite, ordered, linearly dependent subset of V, 
show that some X; can be written as a linear combination of the pre- 
ceding vectors (X1,..., Xi-1). 


A basis for V is a subset S C V that is linearly independent and spans 
V. If S is a basis for V, every element of V has a unique expression as a 
linear combination of elements of S. If V has a finite basis, then V is said to 
be finite-dimensional, and otherwise it is infinite-dimensional. The trivial 
vector space {0} (which we denote by R°) is finite-dimensional, because it 
has the empty set as a basis. 


Lemma A.1. Let V be a finite-dimensional vector space. If V is spanned 
by n vectors, then every subset of V containing more than n vectors is 
linearly dependent. 


Proof. Suppose the vectors {X1,...,X,} span V. To prove the lemma, it 
clearly suffices to show that any n+1 vectors {Yi,..., Yn41} are dependent. 
Suppose not. By Exercise A.1(b), the set {Y1,X1,..., Xn} is dependent. 
This means there is an equation of the form b'Y;+)>, a‘X; = 0 in which not 
all of the coefficients are equal to zero. If 6! is the only nonzero coefficient, 
then Y, = 0 and clearly the set of Y;s is dependent. Otherwise, some a’ is 
nonzero; renumbering the X;s if necessary, we may assume it is at. Since we 
can solve for X, in terms of Y; and the other X;s, the set {Y1, X2,..., Xn} 
still spans V. 

Now suppose by induction that {Y1, Yo,..., Yr-1, Xz,---, Xn} spans V. 
As before, the set {Y1, Yo,..., ¥k-1, Ye, Xz,---, Xn} is dependent, so there 
is a relation of the form 


k n 
So ey, t+ So a'X; =0 
i=l ix=k 


with not all coefficients equal to zero. If all the a’s are zero, the Yjs are 
clearly dependent, so we may assume at least one of the a’s is nonzero, 
and after reordering we may assume it is a*. Solving for X; as before, we 
conclude that the set {Yi, Yo,..., Ye, Xk41,---,Xn} still spans V. Contin- 
uing by induction, we conclude that the vectors {Yi,..., Yn} span V, which 
means that {Yi,...,¥n+41} are dependent by Exercise A.1(b). 


Proposition A.2. If V is a finite-dimensional vector space, all bases of 
V contain the same number of elements. 
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Proof. If {£1,..., En} is a basis for V, then Lemma A.1 implies that any 
set containing more than n elements is dependent, so no basis can have more 
than n elements. Conversely, if there were a basis containing fewer than n 
elements, then Lemma A.1 would imply that {E1,...,E£,} is dependent, 
which is a contradiction. 


Because of the preceding proposition, it makes sense to define the di- 
mension of a finite-dimensional vector space to be the number of elements 
in a basis. 


Exercise A.2. Suppose V is a finite-dimensional vector space. 


(a) Show that every set that spans V contains a basis, and every linearly 
independent subset of V is contained in a basis. 


(b) IfS C V is a subspace, show that S is finite-dimensional and dim S < 
dim V, with equality if and only if S=V. 


An ordered basis of a finite-dimensional vector space is a basis endowed 
with a specific ordering of the basis vectors. For most purposes, ordered 
bases are more useful than bases, so we will assume without comment that 
each basis comes with a given ordering. We will denote an ordered basis by 
a notation such as (£y,...,E,) or (E;). 

If (£1,...,£,) is an (ordered) basis for V, each vector X € V has a 
unique expression as a linear combination of basis vectors: 


n 
X=)°X'E,;. 
i=l 
The numbers X°* are called the components of X with respect to this basis, 
and the ordered n-tuple (X',...,X"”) is called its basis representation. 


(Here is an example of a definition that requires an ordered basis.) 

The fundamental example of a finite-dimensional vector space is of course 
R” = R~x.--- x R, which we call n-dimensional Euclidean space. It is 
a vector space under the usual operations of vector addition and scalar 
multiplication. We will denote a point in R” by any of the notations x or 
(x') or (z!,...,@”); the numbers 2? are called the coordinates of x. They 
are also the components of x with respect to the standard basis (e1,...,€n), 
where e; = (0,...,1,...,0) is the vector with a 1 in the 7th place and zeros 
elsewhere. 

Notice that we always write the coordinates of a point (x!,...,2") € 
R” with upper indices, not subscripts as is usually done in linear algebra 
and calculus books, so as to be consistent with the Einstein summation 
convention (see Chapter 1). 

If S and T are subspaces of a vector space V, the notation S+T denotes 
the set of all vectors of the form X + Y, where X € S and Y € T. It is 
easily seen to be a subspace, and in fact is the subspace spanned by SUT. 
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IfS+T=V and SNT = {0}, V is said to be the direct sum of S and T, 
and we write V=S@T. 

If S is any subspace of V, another subspace TC V is said to be com- 
plementary to Sif V = S @T-. In this case, it is easy to check that every 
vector in V has a unique expression as a sum of an element of S plus an 
element of 7’. 


Exercise A.3. Suppose S and T are subspaces of a finite-dimensional vec- 
tor space V. 

(a) Show that SMT is a subspace of V. 

(b) Show that dim(S +7) = dim S + dim T — dim(SN 7). 

(c) IfV = S+T, show that V = S@T if and only if dim V = dim S+dimT. 


Exercise A.4. Let V be a finite-dimensional vector space. Show that 
every subspace S C V has a complementary subspace in V. In fact, if 
(£\,...,Fn) is any basis for V, show that there is some subset {i1,...,ix} 
of the integers {1,...,n} such that span(E;,,...,Ei,) is a complement to 
S. [Hint: Choose a basis (F\,...,Fm) for S, and apply Exercise A.1(d) to 
the ordered (m+ n)-tuple (Fi,..., Fm, F1,..., En). 


Suppose S C V is a subspace. For any vector x € V, the coset of S 
determined by z is the set 


a+S={xr+y:ye S}. 


A coset is also sometimes called an affine subspace of V parallel to S. The 
set V/S of cosets of S is called the quotient of V by S. 


Exercise A.5. Suppose V is a vector space and S is a subspace of V. 
Define vector addition and scalar multiplication of cosets by 


(c+ S)+(y+S)=(a+y)+S; 
c(a+S) =(cx) +S. 


(a) Show that the quotient V/S is a vector space under these operations. 
(b) If V is finite-dimensional, show that dim V/S = dim V — dim S. 


Linear Maps 


Let V and W be vector spaces. A map T: V > W is linear if T(aX +bY) = 
aT X + bTY for all vectors X,Y € V and all scalars a,b. The kernel of T, 
denoted by Ker T, is the set {X € V : TX = 0}, and its image, denoted by 
ImT or T(V), is {Y €W: Y =TX for some X € V}. 

One simple but important example of a linear map arises in the following 
way. Given a subspace S C V and a complementary subspace T’, there is a 
unique linear map 7: V — S defined by 


mX+Y)=X for X €S,Y eT. 
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This map is called the projection onto S with kernel T. 

A bijective linear map T: V — W is called an isomorphism. In this case, 
there is a unique inverse map T~!: W — V, and the following computation 
shows that T! is also linear: 


al '*X +bT1Y =T 'T(aT 1X +0TY) 
=T \(aTT-'X+bTT"Y) (by linearity of T) 
=T~'(aX + bY). 


For this reason, a bijective linear map is also said to be invertible. If there 
exists an isomorphism T’: V — W, then V and W are said to be isomorphic. 
Isomorphism is easily seen to be an equivalence relation. 


Example A.3. Let V be any n-dimensional vector space, and _ let 
(F,...,£,) be any ordered basis for V. Define a map FE: R" — V by 


BCs 20.98") =o Bites ae" By, 


Then FE is bijective, so it is an isomorphism, called the basis isomorphism 
determined by this basis. Thus every n-dimensional vector space is isomor- 
phic to R”. 


Exercise A.6. Let V and W be vector spaces, and suppose (£1,..., En) 
is a basis for V. For any n elements X1,...,Xn € W, show that there is a 
unique linear map T: V > W satisfying T(£;) = X; fori =1,...,n. 


Exercise A.7. Let S: V — W and T: W — X be linear maps. 


(a) Show that Ker S and ImS are subspaces of V and W, respectively. 
(b) Show that S is injective if and only if Ker S = {0}. 


(c) If S is an isomorphism, show that dimV = dim W (in the sense that 
these dimensions are either both infinite or both finite and equal). 


(d) If S and T are both injective or both surjective, show that To S has 
the same property. 


(e) IfToS is surjective, show that T is surjective; give an example to show 
that S may not be. 


(f) If ToS is injective, show that S is injective; give an example to show 
that T may not be. 


(g) IfV and W are finite-dimensional vector spaces of the same dimension 
and S is either injective or surjective, show that it is an isomorphism. 


Exercise A.8. Suppose V is a vector space and S is a subspace of V, and 
let 7: V — V/S denote the projection onto the quotient space. If T: V — W 
is a linear map, show that there exists a linear map T: V/S — W such that 
Ton=T if and only if S C KerT. 


Linear Algebra 409 


Now suppose V and W are finite-dimensional vector spaces with ordered 
bases (F,...,£,) and (Fi,..., Fim), respectively. If T: V > W is a linear 
map, the matrix of T with respect to these bases is the m x n matrix 

Ai... Al 


n 


A= (Al) = 


whose ith column consists of the components of TE; with respect to the 
basis (F}): 


TE, => AIF. 


j=l 


By linearity, the action of T’ on any vector X = )>, X'E; is then given by 


n n m 
a > 8) =S OS (AXE. 
i=1 i=1 j=1 
If we write the components of a vector with respect to a basis as a column 


matrix, then the matrix representation of Y = TX is given by matrix 
multiplication: 


y! AL... ALY (x! 


y™ Am... am} \xn 


n 


or, more succinctly, 
n 
a jyi 
Y= y A; xX". 
i=1 


It is straightforward to check that the composition of two linear maps is 
represented by the product of their matrices, and the identity tranformation 
of any n-dimensional vector space V is represented with respect to any basis 
by the n x n identity matrix, which we denote by [,,; it is the matrix with 
ones on the main diagonal and zeros elsewhere. 

The set M(m x n, R) of all m x n real matrices is easily seen to be a real 
vector space of dimension mn. In fact, by stringing out the matrix entries 
in a single row, we can identify it in a natural way with R™”. Similarly, 
because C is a real vector space of dimension 2, the set M(m x n,C) of 
m xX n complex matrices is a real vector space of dimension 2mn. 

Suppose A is an n X n matrix. If there is a matrix B such that AB = 
BA = I, then A is said to be invertible or nonsingular; it is singular 
otherwise. 
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Exercise A.9. Suppose A is a nonsingular matrix. 


(a) Show that there is a unique matrix B such that AB = BA = I,. This 
matrix is denoted by A~! and is called the inverse of A. 

(b) If A is the matrix of an invertible linear map T: V — W with respect 
to some bases for V and W, show that A is invertible and A~* is the 
matrix of T~! with respect to the same bases. 


Because R” comes endowed with the canonical basis (e;), we can unam- 
biguously identify linear maps from R” to R™ with m x n matrices, and 
we will often do so without further comment. 

In this book, we often need to be concerned with how various objects 
transform when we change bases. Suppose (£;) and (£;) are two bases 
for a finite-dimensional vector space V. Then each basis can be written 
uniquely in terms of the other, so there is an invertible matrix B, called 
the transition matriz between the two bases, such that 


(A.1) 


Now suppose V and W are finite-dimensional vector spaces and T’: V > 
W is a linear map. With respect to bases (£;) for V and (F;) for W, T 
is represented by some matrix A = (A?). If (E;) and (F;) are any other 
choices of bases for V and W, respectively, let B and C' denote the transition 
matrices satisfying (A.1) and 


F,= s CIF,, 
j=l 

F; a Lees 
w=1 


Then a straightforward computation shows that the matrix A representing 
T with respect to the new bases is related to A by 


Al = SCAB YE, 
k,l 
or, in matrix form, 
A=CAB". 


In particular, if T is a map from V to itself, we usually use the same basis 
in the domain and the range. In this case, if A denotes the matrix of T 
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with respect to (E;), and A with respect to (E;), we have 
A= BAB". 


If V and W are vector spaces, the set L(V, W) of linear maps from V to 
W is a vector space under the operations 


(S+T)X =SX+4TX; (cT)X = c(TX). 


If dim V = n and dim W = m, then any choice of bases for V and W gives 
usa map L(V,W) — M(mxn,R), by sending each linear map to its matrix 
with respect to the chosen bases. This map is easily seen to be linear and 
bijective, so dim L(V,W) = dim M(m x n,R) = mn. 

If T: V — W is a linear map between finite-dimensional spaces, the 
dimension of Im 7’ is called the rank of T, and the dimension of Ker T is 
called its nullity. 


Exercise A.10. Suppose V, W, X are finite-dimensional vector spaces, and 
let S: V — W and T: W = X be linear maps. 


(a) Show that S is injective if and only if rank S = dimV, and S is sur- 
jective if and only if rank S = dimW. 

(b) Show that rank(T o S) < rank S, with equality if and only if ImSnm 
Ker T = {0}. [Hint: Replace W by the quotient space W/ Ker T’] 

(c) Show that rank(T oS) < rankT, with equality if and only if ImS + 
KerT = W. 

(d) If S is an isomorphism, show that rank(ToS) = rankT, and if T is an 
isomorphism, show that rank(T o S) = rank S. 


The following theorem shows that, up to choices of bases, a linear map 
is completely determined by its rank together with the dimensions of its 
domain and range. 


Theorem A.4 (Canonical Form for a Linear Map). Suppose V and 
W are finite-dimensional vector spaces, and T: V — W is a linear map of 
rank r. Then there are bases for V and W with respect to which T has the 
following matriz representation (in block form): 


I, 0 
(6 9): 

Proof. Choose any bases (F\,..., F) for ImT and (4y,..., Ky) for KerT. 
Extend (F;) arbitrarily to a basis (F1,..., Fm) for W. By definition of the 
image, there are vectors £,...,E, € V such that TE; = F; fori =1,...,r. 
We will show that (F,...,F,, 1,...,K,) is a basis for V; once we know 
this, it follows easily that T has the desired matrix representation. 

Suppose first that >, a'E; + >>, b’K; = 0. Applying T to this equation 
yields }>;_, a’F; = 0, which implies that all the coefficients a’ are zero. 
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Then it follows also that all the b/s are zero because the Ks are indepen- 
dent. Therefore, the vectors (F1,..., E,, A1,...,K,) are independent. 

To show that they span V, let X € V be arbitrary. We can express 
TX €ImT as a linear combination of (Fi,..., F,): 


TX = s CF;. 
ab 


If we put Y = 0, c’E; € V, it follows that TY =TX,so Z=X-Ye 
Ker T. Writing Z = )0, d’ Kj, we obtain 


r k 
X=V¥+Z=S E+) 0K; 


i=1 j=l 


so (F,...,F,, K1,...,K,) do indeed span V. 


This theorem says that any linear map can be put into a particularly nice 
diagonal form by appropriate choices of bases in the domain and range. 
However, it is important to be aware of what the theorem does not say: If 
T: V — V isa linear map from a finite-dimensional vector space to itself, 
it may not be possible to choose a single basis for V with respect to which 
the matrix of T is diagonal. 

The next result is central in applications of linear algebra to smooth 
manifold theory; it is a corollary to the proof of the preceding theorem. 


Corollary A.5 (Rank-Nullity Law). Suppose T: V > W is any linear 
map. Then 


dim V = rankT + nullity T 
= dim(Im T) + dim(Ker T). 


Proof. The preceding proof showed that V has a basis consisting of k +r 
elements, where & = dim Ker T and r = dimImT. 


Suppose A is an m x n matrix. The transpose of A is the n x m matrix 
A? obtained by interchanging the rows and columns of A: (A?)J = Aj. (It 
can be interpreted abstractly as the matrix of the dual map; see Chapter 
3.) The matrix A is said to be symmetric if A = A? and skew-symmetric 
if A= —A’. 


Exercise A.11. If A and B are matrices of dimensions m x n and n x k, 
respectively, show that (AB)? = B7 A’. 


The rank of an m x n matrix A is defined to be its rank as a linear map 
from R” to R™. Because the columns of A, thought of as vectors in R™, are 
the images of the standard basis vectors under this linear map, the rank of 
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A can also be thought of as the dimension of the span of its columns, and 
is sometimes called its column rank. Analogously, we define the row rank 
of A to be the dimension of the span of its rows, thought of similarly as 
vectors in R”. 


Proposition A.6. The row rank of any matrix is equal to its column rank. 


Proof. Let A be an m xX n matrix. Because the row rank of A is equal to 
the column rank of A7, we must show that rank A = rank A’. 

Suppose the (column) rank of A is k. Thought of as a linear map from 
R” to R™, A factors through Im A as follows: 


R” a R”™ 


aN. GF 


Im A, 


where A is just the map A with its range restricted to Im A, and z is the 
inclusion of Im A into R™. Choosing a basis for the k-dimensional subspace 
Im A, we can write this as a matrix equation A = BC, where B and C are 
the matrices of 1 and A with respect to the chosen basis. Taking transposes, 
we find A? = C7 B’, from which it follows that rank AT < rank B’. Since 
BT is a k x m matrix, its column rank is at most k, which shows that 
rank A? < rank A. Reversing the roles of A and A’ and using the fact that 
(AT)? = A, we conclude that rank A = rank A’. 


Suppose A = (A?) is an m x n matrix. By choosing nonempty subsets 
{ii1,---, 4%} C {1,...,m} and {fi,..., jx} C {1,...,m}, we obtain a k x 1 
matrix whose entry in the pth row and qth column is Ay : 


ay ral 
Ag ae AG 
tk bis 
A tage 


Such a matrix is called a k x | minor of A. Looking at minors gives a 
convenient criterion for checking the rank of a matrix. 


Proposition A.7. Suppose A is anmxn matriz. Then rank A > k if and 
only if some k x k minor of A is nonsingular. 


Proof. We consider A as usual as a linear map from R” to R™. A sub- 
space of R” or R™ spanned by some subset of the standard basis vectors 
will be called a coordinate subspace. Suppose {%1,...,i,} C {1,...,m} and 
{ji,---,je} C {1,...,n}, and let M denote the k x k minor of A deter- 
mined by these subsets. If P C R” is the coordinate subspace spanned 
by (€;,,---,€;,), and Q C R™ is the coordinate subspace spanned by 
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(€:,,--+,€i,), it is easy to check that M is the matrix of the composite 
map 


PSR" AR" 3Q, 


where 2 is inclusion of P into R” and 7 is the coordinate projection of R™ 
onto Q. Thus to prove the proposition it suffices to show that rank A > k 
if and only if there are k-dimensional coordinate subspaces P and Q such 
that 70 Aov has rank k. 

One direction is easy. If there are such subspaces, then rank. = rank 7 = 
k, and k = rank(7o Ao) < min(k, rank A), which implies that rank A > k. 

Conversely, suppose that rank A = r > k. By Exercise A.4, there exists a 
coordinate subspace P C R” complementary to Ker A. Since dim P = n — 
dim Ker A = r > k by the rank-nullity law, we can choose a k-dimensional 
coordinate subspace P C P. Then A|p is injective, so if: P — R” denotes 
inclusion, it follows that A oz is injective and has rank k. 

Now let S Cc R™ be any coordinate subspace complementary to Im(Aov), 
and let Q be the coordinate subspace complementary to S' (i.e., the span of 
the remaining basis elements). If 7: R™ — Q is the projection onto Q with 
kernel S, then Im(A 0 t) intersects Ker 7 = S trivially, so rank(7 0 Aov) = 
rank Ao. =k by Exercise A.10(b). 


The Determinant 


There are a number of ways of defining the determinant of a square matrix, 
each of which has advantages in different contexts. The definition we will 
give here, while perhaps not pedagogically the most straightforward, is the 
simplest to state and fits nicely with our treatment of alternating tensors 
in Chapter 9. 

We let S,, denote the group of permutations of the set {1,...,n}, called 
the symmetric group on n elements. The properties of S,, that we will need 
are summarized in the following lemma; proofs can be found in any good 
undergraduate algebra text such as [Hun90] or [Her75]. A transposition is 
a permutation obtained by interchanging two elements and leaving all the 
others fixed. A permutation that can be written as a composition of an 
even number of transpositions is called even, and one that can be written 
as a composition of an odd number of transpositions is called odd. 


Lemma A.8 (Properties of the Symmetric Group). 


(a) Every element of S;, can be decomposed as a finite sequence of trans- 
positions. 


(b) For any o € Sp, the parity (evenness or oddness) of the number of 
transpositions in any decomposition of a as a sequence of transposi- 
tions is independent of the choice of decomposition. 


Linear Algebra 415 


(c) The map sgn: S;, > {+1} given by 


sen(a) = 


1 if o is even, 
—1 ifo is odd 


is a surjective group homomorphism, where we consider {+1} as a 
group under multiplication. 


Exercise A.12. Prove (or look up) Lemma A.8. 


If A = (A?) is an n x n (real or complex) matrix, the determinant of A 
is defined by the expression 


det A= S~ (sgn) Af! --- Ag”. (A.2) 
oESn 


For simplicity, we assume throughout this section that our matrices are 
real. The statements and proofs, however, hold equally well in the complex 
case. In our study of Lie groups we will also have occasion to consider 
determinants of complex matrices. 

Although the determinant is defined as a function of matrices, it is also 


useful to think of it as a function of n vectors in R”: If Ay,..., An € R”, 
we interpret det(A;,..., An) to mean the determinant of the matrix whose 
columns are (Aj,..., An): 

Pe cesu't Ae 


det(Ai,...,An) =det | : —— 
Av... A” 
It is obvious from the defining formula (A.2) that the function det: R” x 


---» x R” — R so defined is multilinear, which means that it is linear as a 
function of each vector when all the other vectors are held fixed. 


Proposition A.9 (Properties of the Determinant). Let A be annx 
nm matrix. 


(a) If one column of A is multiplied by a scalar c, the determinant is 
multiplied by the same scalar: 


det(Aj, sae ,cA;, oes ,An) = cdet(Ai, ar Ai, wee , An). 
(b) The determinant changes sign when two columns are interchanged: 


det Aiyotea A rene Aeon ay Ae 
= —det(Aj,...,Ap,...;Ag)-.-;4n).  (A.3) 
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(c) The determinant is unchanged by adding a scalar multiple of one 
column to any other column: 


det(Ai,...,Ai,..., Aj +cAi,..., An) 
= det(Ai,...,Aj,..., Aj..., An). 


(d) For any scalar c, det(cA) = c” det A. 
(e) If any two columns of A are identical, then det A = 0. 
(f) det A? = det A. 


(g) Ifrank A <n, then det A=0. 


Proof. Part (a) is part of the definition of multilinearity, and (d) follows 
immediately from (a). To prove (b), suppose p < g and let 7 € S,, be 
the transposition that interchanges p and q, leaving all other indices fixed. 
Then the left-hand side of (A.3) is equal to 


det(A1,...,Ag)-.-)Aps-+- An) = D> (sgna) Ag? AP... ACT. Ag” 


oESn 
= S- (sgn) Aq! «-- AI? .» ASP... Ag” 
cESn 
= So (seno)Aq7!.-. Agr" 
cESn 
=— S> (sgn(or)) Ag"! --- Ag” 
oESn 
=— So (sgnn) Aq: + An” 
NESn 
=— det(Aj, wie ,An), 


where the next-to-last line follows by substituting 7 = o7 and noting that 
7 runs over all elements of S,, as a does. Part (e) is then an immediate 
consequence of (b), and (c) follows by multilinearity: 


det(Ai,...,Ai,..., A; + cAj,..., An) 
=det(Ai,...,Aj,..., Aj..., An) + edet(A1,...,Ai,..., Ai..., An) 
=det(Ai,...,Ai,..., Aj... An) +0. 
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Part (f) follows directly from the definition of the determinant: 


det AT = S~ (sgna) A}, --- Az, 
cESn 
D (sgn) Ag, Ag, 
oESn 
S- (sgn o) Ag} . de (multiplication is commutative) 
oESn 
= S- (sen) A?) -.. Am™ (substituting 7 = 0~') 
NESn 
= det A. 


I 


I 


Finally, to prove (g), suppose rank A < n. Then the columns of A are 
dependent, so at least one column can be written as a linear combination of 
the others: Aj = )U,4; c'A;. The result then follows from the multilinearity 
of det and (e). 


The operations on matrices described in parts (a), (b), and (c) of the 
preceding proposition (multiplying one column by a scalar, interchanging 
two columns, and adding a multiple of one column to another) are called 
elementary column operations. Part of the proposition, therefore, describes 
precisely how a determinant is affected by elementary column operations. 
If we define elementary row operations analogously, the fact that the deter- 
minant of A’ is equal to that of A implies that the determinant behaves 
similarly under elementary row operations. 

Each elementary column operation on a matrix A can be realized by mul- 
tiplying A on the right by a suitable matrix, called an elementary matric. 
For example, multiplying the ith column by c is achieved by multiplying A 
by the matrix EF, that is equal to the identity matrix except for a c in the 
(i, 7) position: 


BU se BME ns AINE oe IG ns SO 
At ay an 0 0 1 
Al cA! Al 
=|A ol Aj 
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Observe that det E, = c. 


Exercise A.13. Show that interchanging two columns of a matrix is 
equivalent to multiplying on the right by a matrix whose determinant is 
—1, and adding a multiple of one column to another is equivalent to multi- 
plying on the right by a matrix of determinant 1. 


Exercise A.14. Suppose A is a nonsingular n x n matrix. 


(a) Show that A can be reduced to the identity I, by a sequence of ele- 
mentary column operations. 


(b) Show that A is equal to a product of elementary matrices. 


Elementary matrices form a key ingredient in the proof of the following 
theorem, which is arguably the deepest and most important property of 
the determinant. 


Theorem A.10. Jf A and B are n x n matrices, then 
det(AB) = (det A) (det B). 


Proof. If B is singular, then rank B < n, which implies that rank AB < 
n. Therefore both det B and det AB are zero by Proposition A.9(g). On 
the other hand, parts (a), (b), and (c) of Proposition A.9 combined with 
Exercise A.13 show that the theorem is true when B is an elementary 
matrix. If B is an arbitrary nonsingular matrix, then B can be written as 
a product of elementary matrices by Exercise A.14, and then the result 
follows by induction. 


Corollary A.11. If A is a nonsingular n x n matrix, then det(A~') = 
(det A)~?. 


Proof. Just note that 1 = det I, = det(AA~+) = (det A)(det A~?). 


Corollary A.12. A square matrix is nonsingular if and only if its deter- 
minant ts nonzero. 


Proof. One direction follows from Proposition A.9(g); the other from Corol- 
lary A.11. 


For actual computations of determinants, the formula in the following 
proposition is usually more useful than the definition. 


Proposition A.13 (Expansion by Minors). Let A be annxn matric, 
and for eachi, 7 let Mj denote the (n—1)x(n—1) minor obtained by deleting 
the ith column and jth row of A. For any fixed i between 1 and n inclusive, 


det A= 5 °(-1)'t7 AJ det Mj. (A.4) 
j=l 
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Proof. It is useful to consider first a special case: Suppose A is an n x n 
matrix that has the block form 


= & i) . (A.5) 


where B is an (n — 1) x (n — 1) matrix and C is a 1 x n row matrix. 
Then in the defining formula (A.2) for det A, the factor A?” is equal to 
1 when on = n and zero otherwise, so in fact the only terms that are 
nonzero are those in which o € S,_1, thought of as the subgroup of S, 
consisting of elements that permute {1,...,n—1} and leave n fixed. Thus 
the determinant of A simplifies to 


det A = x (sgna)Ag!--- Age) = det B. 
oESn-1 


Now let A be arbitrary, and fix i between 1 and n. For each j = 1,...,n, 
let X/ denote the matrix obtained by replacing the ith column of A by 
the basis vector e;. Since the determinant is a multilinear function of its 
columns, 


det A = det Ayyainy Apia) Al ej Assiy..ay Ap 
j=l 


Al det(A1,..., Ai—1,€j, Aita,---; An) (A.6) 


i 
ee 


al 


&. 
Il 


Al det X/. 


| 
[ele 


al 


&. 
Il 


On the other hand, by interchanging columns n — i times and then in- 
terchanging rows n — 7 times, we can transform x to a matrix of the 
form (A.5) with B = M/. Therefore, by the observation in the preceding 
paragraph, 


det X? = (—1)"-**"-4 det M? = (—1)*) det M2. 


Inserting this into (A.6) completes the proof. 


Formula (A.4) is called the expansion of det A by minors along the ith 
column. Since det A = det AT, there is an analogous expansion along any 
row. The factor (—1)’*? det M? multiplying A? in (A.4) is called the cofac- 
tor of A?, and is denoted by cof ?. 


Proposition A.14 (Cramer’s Rule). Let A be a nonsingular nxn ma- 


tric. Then A~* is equal to 1/(det A) times the transposed cofactor matrix 
of A: 


fe qc det M). (A.7) 


420 Appendix: Review of Prerequisites 


Proof. Let Bi denote the expression on the right-hand side of (A.7). Then 


a 1 ¥<& ae : 

> BAY = a det Mj. (A.8) 
j=l j=l 

When & = i, the summation on the right-hand side is precisely the expan- 
sion of det A by minors along the ith column, so the right-hand side of 
(A.8) is equal to 1. On the other hand, if k 4 7, the summation is equal to 
the determinant of the matrix obtained by replacing the ith column of A 
by the kth column. Since this matrix has two identical columns, its deter- 
minant is zero. Thus (A.8) is equivalent to the matrix equation BA = I, 
where B is the matrix (B/). Multiplying both sides on the right by A7?, 
we conclude that B= Aq}. 


A square matrix A = (A?) is said to be upper triangular if A? = 0 for 
j >t (ie., the only nonzero entries are on and above the main diago- 
nal). Determinants of upper triangular matrices are particularly easy to 
compute. 


Proposition A.15 (Determinant of an Upper Triangular Matrix). 
If A is an upper triangular n x n matrix, then the determinant of A is the 
product of its diagonal entries: 


det A= Al... A®. 


Proof. When n = 1, this is trivial. So assume the result is true for (n — 
1) x (n— 1) matrices, and let A be an upper triangular n x n matrix. In 
the expansion of det A by minors along the first column, there is only one 
nonzero entry, namely Aj det M}. By induction det Mj = A3--- A", which 
proves the proposition. 


Suppose X is an (m+ k) x (m+ k) matrix. We say X is block upper 
triangular if X has the form 
A B 
x=(4 8) ie 


for some matrices A, B,C of sizes m x m, mx k, and k x k, respectively. 


Proposition A.16. If X is the block upper triangular matrix given by 
(A.9), then det X = (det A)(det C). 


Proof. If A is singular, then clearly the columns of X are linearly depen- 
dent, which implies that det X = 0 = (det A)(det C’). So let us assume that 
A is nonsingular. 

Consider first the following special case: 
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Expanding by minors along the first column, an easy induction shows that 
det X = det C in this case. A similar argument shows that 


A O 
act (4 p) = acta, 


The general case follows from these two observations together with the 
factorization 


(oO c)=(o a)CG oe) (3 An’) 


noting that the last matrix above is upper triangular with ones along the 
main diagonal. 


Inner Products and Norms 


If V is a real vector space, an inner product on V isa map from V xV — V, 
usually written (X,Y) (X,Y), that is 


(i) SYMMETRIC: 
(X,Y) = (¥,X). 
(ii) BILINEAR: 
(aX +a’X',Y) =a(X,Y)+a'(X’,Y), 
(X,bY + UY’) = b(X,Y) +b'(X,Y’). 
(iii) POSITIVE DEFINITE: 


(X,X) >0 unless X = 0. 


A vector space endowed with a specific inner product is called an inner 
product space. The standard example is, of course, R” with its dot product 
or Euclidean inner product: 


nm 
Gy) =esy= > o'y. 
i=1 


Suppose V is an inner product space. For any X € V, the length of X 
is the number |X| = ./(X,X). A unit vector is a vector of length 1. The 
angle between two nonzero vectors X,Y € V is defined to be the unique 
6 € [0, 7] satisfying 


Two vectors X,Y € V are said to be orthogonal if the angle between them 
is 7/2, or equivalently if (X,Y) =0. 
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Exercise A.15. Let V be an inner product space. Show that the length 
function associated with the inner product satisfies 


|X| > 0, X EV, X £0; 
|cX| = |e| |X], CER XEeV; 
|X+Y|< |X|+]Y], X,YEV. 


Suppose V is a finite-dimensional inner product space. A_ basis 
(F,...,E,) of V is said to be orthonormal if each E; is a unit vector 
and £; is orthogonal to E; when i ¥ j. 


Proposition A.17 (The Gram-Schmidt Algorithm). Every finite- 
dimensional inner product space V has an orthonormal basis. In fact, if 
(Fy,..., Ey) is any basis of V, there is an orthonormal basis (E1,..., En) 
with the property that 


span(Fy,..., Bx) = span(E}, - Ee fork =1,...,n. (A.10) 


Proof. The proof is by induction on n = dim V. If n = 1, there is only one 
basis element E), and then E, = E;/|E£;| is an orthonormal basis. 

Suppose the result is true for inner product spaces of dimension n — 1, 
and let V have dimension n. Then W = span(Fj,..., En—1) is an (n — 1)- 
dimensional inner product space with the inner product restricted from 
V, so there is an orthonormal basis (Fy,...,En—1) satisfying (A.10) for 
k=1,...,n—1. Define E,, by 


eR (tee Sita Die OF a 


En = t=1 
En — ea (En, Bi) E; 
A computation shows that (Ey, ms ES) is the desired orthonormal basis. 


An isomorphism T: V — W between inner product spaces is called an 
isometry if it takes the inner product of V to that of W: 


(TX,TY) = (X,Y). 


Exercise A.16. Show that any isometry is a homeomorphism that pre- 
serves lengths, angles, and orthogonality, and takes orthonormal bases to 
orthonormal bases. 


Exercise A.17. If (£;) is any basis for the finite-dimensional vector space 
V, show that there is a unique inner product on V for which (E;) is ortho- 
normal. 


Exercise A.18. Suppose V is a finite-dimensional inner product space 
and FE: R” — V is the basis map determined by any orthonormal basis. 
Show that F is an isometry, where R” is endowed with the Euclidean inner 
product. 
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The preceding exercise shows that finite-dimensional inner product 
spaces are topologically and geometrically indistinguishable from the 
Euclidean space of the same dimension. Thus any such space automatically 
inherits all the usual properties of Euclidean space, such as compactness of 
closed and bounded subsets. 

If V is a finite-dimensional inner product space and $ C V is a subspace, 
the orthogonal complement of S in V is the set 


Vta{X eV: (X,Y) =0 for all Y € S.}. 


Exercise A.19. Let V be a finite-dimensional inner product space and 
SC V any subspace. Show that V = S@S*. 


Thanks to the result of the preceding exercise, for any subspace S$ of an 
inner product space V, there is a natural projection 7: V — S with kernel 
S+. This is called the orthogonal projection of V onto S. 

A norm on a vector space V is a function from V to R, written X +> |X], 
satisfying 


(i) Positivity: |X| > 0 for every X € V, and |X| = 0 if and only if 
X =0. 


(ii) HOMOGENEITY: |cX| = |c| |X| for every ce Rand X EV. 
(iii) TRIANGLE INEQUALITY: |X + Y| < |X|+|Y| for all X,Y eV. 


A vector space together with a specific choice of norm is called a normed 
linear space. Exercise A.15 shows that the length function associated with 
any inner product is a norm. 

If V is a normed linear space, then for any p € V and any r > 0 we 
define the open ball and closed ball of radius r around p, denoted by B,.(p) 
and B,(p), respectively, by 


B,(p) ={x#eEV:|x—p| <r}, 
B,(p) ={x EV: |x —p| <r}. 


Given a norm on V, the function d(X,Y) = |X —Y| is a metric, yielding 
a topology on V called the norm topology. The set of all open balls is easily 
seen to be a basis for this topology. Two norms | - |; and | - |2 are said to 
be equivalent if there are positive constants c,C' such that 


|X |1 < |X |o < C|X|1 for all X EV. 


Exercise A.20. Show that equivalent norms on a vector space V deter- 
mine the same topology. 


Exercise A.21. Show that any two norms on a finite-dimensional vector 
space are equivalent. [Hint: first choose an inner product on V, and show 
that the unit ball in any norm is compact with respect to the topology 
determined by the inner product.] 
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If V and W are normed linear spaces, a linear map T: V — W is said 
to be bounded if there exists a positive constant C’ such that 


ITX| < C|X| for all X EV. 


Exercise A.22. Show that a linear map between normed linear spaces is 
bounded if and only if it is continuous. 


Exercise A.23. Show that every linear map between finite-dimensional 
normed linear spaces is continuous. 


The vector space M(m x n,R) of m x n real matrices has a natural 
Euclidean inner product, obtained by identifying a matrix with a point in 
Re: 


A-B=)_ A‘Bi. 


tJ 


This yields a Euclidean norm on matrices: 


[Al = 1/5, (42. (A.11) 


Whenever we use a norm on a space of matrices, it will always be assumed 
to be this Euclidean norm. 


Exercise A.24. For any matrices A € M(m x n,R) and B € M(nx k,R), 
show that 


|AB| < |A| |B]. 


Calculus 


In this section, we summarize the main results from multivariable calculus 
and real analysis that are needed in this book. For details on most of the 
ideas touched on here, you can consult [Rud76] or [Apo74]. 

If U Cc R” is any subset and F': U — R™ is any map, we write the 
components of F(x) as F(x) = (F'(a),...,F™(a)); this defines n functions 
F',...,F": U — R called the component functions of F. Note that F* = 
to F, where 7*: R™ — R is the projection on the ith coordinate: 


For maps between Euclidean spaces, there are two separate but closely 
related types of derivatives: partial derivatives and total derivatives. We 
begin with partial derivatives. 
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Partial Derivatives 


Suppose U C R” is open and f: U — R is any real-valued function. For 
any a = (al,...,a") € U and any j = 1,...,n, the jth partial derivative 
of f at a is defined by differentiating with respect to 27 and holding the 
other variables fixed: 

ws Pla ett Ay eG) f(a+ he;) 
onl Ss = =] 

era a h =) ho 


if the limit exists. For a vector-valued function F': U — R™, the partial 
derivatives of F are defined simply to be the partial derivatives OF’ /Ox/ of 
the component functions of F. The matrix (OF"/0x’) of partial derivatives 
is called the Jacobian matriz of F’. 

If F: U — R™ is a map for which each partial derivative exists at each 
point in U and the functions 0F*/02): U — R so defined are all continuous, 
then F is said to be of class C! or continuously differentiable. If this is the 
case, we can differentiate the functions 0F"/Ox/ to obtain second-order 
partial derivatives 


OP bi (Oe 
Ox Oxi ~~ Oak \ Aad}? 


if they exist. Continuing this way leads to higher-order partial derivatives— 
the partial derivatives of F' of order k are the partial derivatives of those 
of order k — 1, when they exist. 

In general, for k > 0, a function F': U — R™ is said to be of class 
C* or k times continuously differentiable if all the partial derivatives of 
F of order less than or equal to k& exist and are continuous functions on 
U. (Thus a function of class C° is just a continuous function.) A function 
that is of class C* for every k > 0 is said to be of class C™, smooth, or 
infinitely differentiable. Because existence and continuity of derivatives are 
local properties, clearly F is C1 (or C* or smooth) if and only if it has that 
property in a neighborhood of each point in U. 

We will often be most concerned with real-valued functions, that is, func- 
tions whose range is R. If U C R” is open, the set of all real-valued functions 
of class C* on U is denoted by C*(U), and the set of all smooth real-valued 
functions by C*(U). By virtue of the following exercise, C°°(U) is a vector 
space under pointwise addition and multiplication by constants: 


(f + 9)(@) = f(a) + g(x) 
(cf)(@) = e(f(2)). 


In fact, it is also a ring, with multiplication defined pointwise: 


(f9)(x) = F(a) g(2). 
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Exercise A.25. Let U C R” be an open set, and suppose f,g: U — R” 
are smooth. 


(a) Show that f +g is smooth. 
(b) Show that fg is smooth. 


(c) If g never vanishes on U, show that f/g is smooth. 


The following important result shows that, for most interesting maps, 
the order in which we take partial derivatives is irrelevant. For a proof, see 
[Rud76]. 


Proposition A.18 (Equality of Mixed Partial Derivatives). If U 
is an open subset of R” and F: U > R™ is a map of class C?, then 
the mixed second-order partial derivatives of F do not depend on the order 
of differentiation: 


OF OF 


Oxi Oxk ~~ Oxk Oxi” 


Corollary A.19. If F: U — R™ is smooth, then mixed partial derivatives 
of any order are independent of the order of differentiation. 


Another important property of smooth functions is that integrals of 
smooth functions can be differentiated under the integral sign. A precise 
statement is given in the next theorem; this is not the best that can be 


proved, but it is more than sufficient for our purposes. For a proof, see 
[Rud76]. 


Theorem A.20 (Differentiation Under an Integral Sign). Let U c 
R” be an open set, a,b € R, and let f: U x [a,b] — R be a continuous 
function such that the partial derivative Of /Ot: U x [a,b] — R is also 
continuous. Define F: U — R by 


b 
F(a)= | f(a, t) dt. 


Then F is of class C', and its partial derivatives can be computed by dif- 
ferentiating under the integral sign: 


ON Of 
Ont) ~ q Oxt 


(a, t) dt. 


Theorem A.21 (Taylor’s Formula with Remainder). Let U Cc R” 
be a convex open set, and suppose f is a smooth real-valued function on 
U. For any integer m > 0, any a € U, and all v € R” small enough that 
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a+veU, 
fa+vy =>) ,> Tag aa“ a)uit yt 
k=0 721,..-,tK 
1 1 oe omtls ‘ ; 
eo = [ ie t) Aha Deda (a+ tv)v™ ---v'™+1 dt, (A.12) 
UL geeey tm+1 


where each index i; runs from 1 to n. 


Proof. The proof is by induction on m. When m = 0, it follows from the 
fundamental theorem of calculus and the chain rule: 


f(atv)- o=[ 3 af (at tv) \dt = ae -(a + tu)v" dt. 


So suppose the formula holds for some m > 0. To prove it for m+ 1, we 
integrate by parts in (A.12), with 


amt 
U= S- f (a+ tu)ut ---yimtt, 


Ue oie tm+1 
x ax 


m2 2 5 


- + Ogimt2 


1 1 ah ots é : p 
S- | (1 -#) Amica (a+ tu)u -+-u'™+! dt 


1 1 “hf ont? f , y 
Aan oe — ty)u"t ..-y'mt2 dt 
+f (m+ 1)! ( 2 Oa + Ogimt2 Cea . 


a: S- 1 ont! f (a)v" Poe yimti 
- (m+ 1)! Oatt - ++ Oatmti 


U1 yeystm4t1 


+> i 1 (y — ym ours (a + tu)u"! «+. dt 
5 3 00) 0 lla % 
9 (m+ 1)! ait .-- Agpimea ° 
’ +2 


Inserting this into (A.12) completes the proof. 
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Corollary A.22 (First-Order Taylor Formula). With f, a, and v as 
above, 


Fla+v) = fla) + >> SE (ao! + aioe’ 


for some smooth functions g1,...,;Gn defined on U. 


The notation O(|z|*) is used to denote any function G(x) defined on a 
neighborhood of the origin in R" which satisfies |G(x)| < C|z|* for some 
constant C’' and all sufficiently small x. 


Corollary A.23 (Second-Order Taylor Formula). With f, a, and v 
as above, 


| “ of | ) ty 1 . Of 
Ox? mae 2 Ox? Oxi 


i=1 


fla+v) = fla) (a)v'v? + O(\v|*). 


Exercise A.26. Prove the previous two corollaries. 


We will sometimes need to consider smooth maps on subsets of R” that 
are not open. If A C R” is any subset, a map Ff’: A — R™ is said to be 
smooth if it extends to a smooth map U — R* on some open neighborhood 
U of A. 


The Total Derivative 


For maps between (open subsets of) finite-dimensional vector spaces, there 
is another very important notion of derivative, called the total derivative. 

Let V,W be finite-dimensional vector spaces, which we may assume to 
be endowed with norms. If U C V is an open set, a map F’: U — W is said 
to be differentiable at a € U if there exists a linear map L: V — W such 
that 


_. F(a+v)— F(a) — Lv 
lim 
v0 |v| 


=0. (A.13) 


Because all norms on a finite-dimensional vector space are equivalent, this 
definition is independent of the choices of norms on V and W. 


Exercise A.27. Suppose F': U — W is differentiable. Show that the lin- 
ear map L satisfying (A.13) is unique. 


If F is differentiable at a, the linear map L satisfying (A.13) is denoted 
by DF(a) and is called the total derivative of F at a. Condition (A.13) can 
also be written 


F(a+v) = F(a)+ DF(a)u+ Riv), (A.14) 
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where the remainder term R(v) satisfies R(v)/|vu| — 0 as uv — 0. One 
thing that makes the total derivative so powerful is that it makes sense 
for arbitrary finite-dimensional vector spaces, without the need to choose 
a basis or even a norm. 


Exercise A.28. Suppose V,W are finite-dimensional vector spaces; U C 
V is an open set;a € U; F,G: U + W; and f,g: U — R. 


(a) If F is differentiable at a, show that F’ is continuous at a. 


(b) If F and G are differentiable at a, show that F'+G is also, and 
D(F + G)(a) = DF (a) + DG(a). 
(c) If f and g are differentiable at a € U, show that fg is also, and 
D(fg9)(a) = f(a)Dg(a) + g(a) DF (a). 


(d) If f is differentiable at a and f(a) 4 0, show that 1/f is differentiable 
at a, and 


D(1/f)(a) = —(1/f(a)”)D f(a). 


Proposition A.24 (The Chain Rule for Total Derivatives). Sup- 
pose V,W,X are finite-dimensional vector spaces, U C V and U CW are 
open sets, and F: U > U and G:U = X are maps. If F is differentiable 
ata €U and G is differentiable at F(a) € U, then Go F is differentiable 
at a, and 


D(Go F)(a) = DG(F(a)) o DF(a). 
Proof. Let A= DF(a) and B = DG(F(a)). We need to show that 


ine Go F(a+v) —Go F(a) — BAv 


v—0 |v| 


=0. (A.15) 


We can rewrite the quotient in (A.15) as 


G(F(a+v)) — G(F(a)) — B(F(a+ v) — F(a)) 


|v| 
Sng Gee oS) eer 


lo 


Asv— 0, F(a+v) — F(a) > 0 by continuity of F’. Therefore the differen- 
tiability of G at F(a) implies that, for any ¢ > 0, we can make 


|IG(F(a+ v)) — G(F(@)) — B(F(a + v) — F(a))| < elF(a+ v) — F(a)| 
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as long as |v| lies in a small enough neighborhood of 0. Thus for |v| small 
(A.16) is bounded by 


ia esiOE (ca eed e2 ve 


Restricting |v| to an even smaller neighborhood of 0, we can ensure that 
|F(a+v) — F(a) — Av| < ev, 


because of the differentiability of F at a. Since the linear map B is contin- 
uous, it follows that (A.17) can be made as small as desired by choosing 
|v| small enough, thus completing the proof. 


Now let us specialize to the case of maps between Euclidean spaces. 
Suppose U Cc R” is open and F: U — R” is differentiable at a € U. 
As a linear map between Euclidean spaces R” and R™, DF(a) can be 
identified with an m x n matrix. The next lemma identifies that matrix as 
the Jacobian of F’. 


Lemma A.25. Let U C R” be open, and suppose F: U — R™ is differ- 
entiable at a © U. Then all of the partial derivatives of F at a exist, and 
DF(a) is the linear map whose matrix is the Jacobian of F at a: 


pr) = (SS). 


Proof. Let B = DF(a). Applying the definition of differentiability with 
v = te;, we obtain 


j .) — Fi(a) —tBi 
jae Fi(a+ te;) — F4(a) — tB . 
t—0 |t| 


Considering t > 0 and t < 0 separately, we find 


Fi(a + te;) — Fi(a) — tBi 


0 = lim 
t\0 t 
Fi(a+te;) — Fi ‘ 
sig aate aK) Bi. 
t\0 t 
Jj .) — Fi(q) —tBi 
i im 2 6a t ti) Fi(a) — tB: 
t70 t 
J )_ fi : 
*: (tim A) #9) Bh). 
t70 t 


Combining these results, we obtain OF! /dx'(a) = B/ as claimed. 
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Exercise A.29. Suppose U C R” is open. Show that a map F': U — R™ 
is differentiable at a € U if and only if each of its component functions 
F',...,F™ is differentiable at a, and 
DF"(a) 
DF(a) = : 
DF (a) 


The next proposition gives the most important sufficient condition for 
differentiability; in particular, it shows that all of the usual functions of 
elementary calculus are differentiable. For a proof, see [Rud76]. 


Proposition A.26. Let U C R” be open. If F: U > R™ is of class C', 
then it is differentiable at each point of U. 


Exercise A.30. If T: R” — R”™ is a linear map, show that T is differen- 
tiable at each a € R”, with DT (a) =T. 


In the case of maps between Euclidean spaces, the chain rule can be 
rephrased in terms of partial derivatives. 


Corollary A.27 (The Chain Rule for Partial Derivatives). Let 
U Cc R® and U C R™ be open sets, and let x = (x,...,2") denote the 


coordinates on U and y = (y',...,y™) those on U. 


(a) Any composition of C’ functions F: U > U andG: U > R? is again 
of class C1, with partial derivatives given by 


AGoF),. oa OF 
Far (2) = ss ae Poa Oe 


(b) If F and G are smooth, then Go F is smooth. 


Exercise A.31. Prove Corollary A.27. 


From the chain rule and induction one can derive formulas for the higher 
partial derivatives of a composite map as needed, provided the maps in 
question are sufficiently differentiable. 

Now suppose f: U — R is a smooth real-valued function on an open 
set U C R", and a € U. For any vector v € R”, we define the directional 
derivative of f in the direction v at a to be the number 


Dyf(a)= —|  f(attv). (A.18) 
t=0 


(This definition makes sense for any vector v; we do not require v to be a 
unit vector as one sometimes does in elementary calculus.) 
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Since D, f(a) is the ordinary derivative of the composite map t i a+ 
tur f(a+tv), by the chain rule the directional derivative can be written 
more concretely as 


Def (a) = Sov! of (d= Deau: 


i=1 


The next result gives an important estimate for the local behavior of a 
C* function in terms of its derivative. If U C R” is any subset, a function 
F: U — R” is said to be Lipschitz continuous on U if there is a constant 
C such that 


|F(2) — F(y)|< Cla—y| for all z,y € U. (A.19) 


Any such C is called a Lipschitz constant for F. 


Proposition A.28 (Lipschitz Estimate for C! Functions). Let 
U Cc R” be an open set, and let F: U — R™ be of class C!. Then F is 
Lipschitz continuous on any closed ball B C U, with Lipschitz constant 
M = supyep |DF(a)| (where the norm on DF(a) is the Euclidean norm 
(A.11) for matrices). 


Proof. Let a,b € B be arbitrary, and define v = b—a and G(t) = F(a+tv). 
Because B is convex, G is defined and of class C1 for t € [0,1]. The mean- 
value theorem of one-variable calculus implies 


G(1) — G(0) = G'(to)(1 — 0) = G'(to) 


for some to € [0, 1]. Using the definition of G, the chain rule, and Exercise 
A.24, this yields 


|F(b) — F(@)| = |DF(a + tov)o| 
< |DF(a + tov)|lo| 
< M\|b-al, 


which was to be proved. 


Multiple Integrals 


In this section, we give a brief review of some basic facts regarding multiple 
integrals in R”. For our purposes, the Riemann integral will be more than 
sufficient. Readers who are familiar with the theory of Lebesgue integration 
are free to interpret all of our integrals in the Lebesgue sense, because the 
two integrals are equal for the types of functions we will consider. For 
more details on the aspects of integration theory described here, you can 
consult nearly any text that treats multivariable calculus rigorously, such 
as [Apo74, Fle77, Mun91, Rud76, Spi65). 
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A rectangle in R” (also called a closed rectangle) is a product set of 
the form [a',b'] x --» x [a",b”], for real numbers a’ < b'. Analogously, 
an open rectangle is the interior of a closed rectangle, a set of the form 
(at, b!) x---x (a, b"). The volume of a rectangle A of either type, denoted 
by Vol(A), is defined to be the product of the lengths of its component 
intervals: 


Vol(A) = (b! — a!)---(b" — a”). 


A rectangle is called a cube if all of its side lengths |b; — a;| are equal. 

A partition of a closed interval [a,b] is a finite set P = {ao,...,ax} 
of real numbers such that a = agp < ay < --: < ay = b. Each of the 
intervals [a;_1,a@;] for i = 1,...,k is called a subinterval of the partition. 
Similarly, a partition P of a rectangle A = [a',b!] x --- x [a”,b"] is an n- 
tuple (P,,...,P,), where each P; is a partition of [a’, b']. Each rectangle of 
the form J, x---xI,, where I; is a subinterval of P;, is called a subrectangle 
of P. Clearly A is the union of all the subrectangles in any partition, and 
distinct subrectangles overlap only on their boundaries. 

Suppose A C R” is a closed rectangle and f: A — R is a bounded 
function. For any partition P of A, we define the lower sum of f with 
respect to P by 


L(f,P) = > (inf f) Vol(Rj), 


where the sum is over all the subrectangles R; of P. Similarly, the upper 
sum is 


U(f,P) =) _(sup f) Vol(R;). 
i: 
The lower sum with respect to P is obviously less than or equal to the 
upper sum with respect to the same partition. In fact, more is true. 
Lemma A.29. Let A C R” be a rectangle, and let f: A > R be a bounded 
function. For any pair of partitions P and P' of A, 
TGP) SUE B)s 


Proof. Write P = (P,,...,P,) and P’ = (P{,...,P*), and let Q be the 
partition Q = (P, UP{,...,P, UP’). Each subrectangle of P or P’ is a 
union of finitely many subrectangles of Q. An easy computation shows 


L(f,P) < L(f,Q) < UF,Q) < UF, P’), 


from which the result follows. 
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The lower integral of f over A is 
[is dV = sup{L(f, P): P is a partition of A}, 
and the et is 
[i dV = inf{U(f, P): P is a partition of A}. 


Clearly both numbers exist because f is bounded, and Lemma A.29 implies 
that the lower integral is less than or equal to the upper integral. 

If the upper and lower integrals of f are equal, we say that f is (Riemann) 
integrable, and their common value, denoted by 


[tev 


is called the integral of f over A. The “dV” in this notation, like the “dz” 
in the notation for single integrals, does not have any meaning in and of 
itself; it is just a “closing bracket” for the integral sign. Other notations in 
common use are 


ss or [feet ao" or ff et det de 
A A A 


In R?, the symbol dV is often replaced by dA. 

There is a simple criterion for a bounded function to be Riemann inte- 
grable. It is based on the following notion. A subset A C R” is said to have 
measure zero if for any 6 > 0, there exists a countable cover of A by open 
cubes {C;} such that 5); Vol(C;) < 6. (For those who are familiar with the 
theory of Lebesgue measure, this is equivalent to the condition that the 
Lebesgue measure of A is equal to zero.) 


Lemma A.30 (Properties of Measure Zero Sets). 


a) A countable union of sets of measure zero in R" has measure zero. 


b 


Any subset of a set of measure zero in R” has measure zero. 


c) A set of measure zero in R” can contain no open set. 


(a) 
(b) 
(¢) 
(d) 


Any proper affine subspace of R” has measure zero in R”. 
Exercise A.32. Prove Lemma A.30. 


Part (d) of this lemma illustrates that having measure zero is a property 
of a set in relation to a particular Euclidean space containing it, not of a 
set in and of itself—for example, an open interval in the x-axis has measure 
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zero as a subset of R*, but not when considered as a subset of R!. For this 
reason, we sometimes say a subset of R” has n-dimensional measure zero 
if we wish to emphasize that it has measure zero as a subset of R”. 

The following proposition gives a sufficient condition for a function to be 
integrable. It shows, in particular, that every bounded continuous function 
is integrable. 


Proposition A.31 (Lebesgue’s Integrability Criterion). Let A C 
R” be a rectangle, and let f: A — R be a bounded function. If the set 


S={xeA: f is not continuous at x} 
has measure zero, then f is integrable. 


Proof. Let ¢ > 0 be given. By definition of measure zero sets, S can be 
covered by a countable collection of open cubes {C;} with total volume 
less than e. 

For each point ¢g € A\ S, since f is continuous at q, there is a cube Dg 
centered at q such that |f(a) — f(q)| < ¢ for all c € D,N A. This implies 
supp, f —infp, f < 2e. 

The collection of all open cubes of the form Int C; or Int Dg is an open 
cover of A. By compactness, finitely many of them cover A. Let us relabel 
these cubes as {C),...,Cx, Di,...,Di}. Replacing each C; or D; by its 
intersection with A, we may assume that each C; and each D; is a rectangle 
contained in A. 

Since there are only finitely many rectangles {C;, D;}, there is a partition 
P with the property that each C; or D; is equal to a union of subrectangles 
of P. (Just use the union of all the endpoints of the component intervals of 
the rectangles C;; and D; to define the partition.) We can divide the sub- 
rectangles of P into two disjoint sets C and D such that every subrectangle 
in @ is contained in C; for some i, and every subrectangle in D is contained 
in D; for some j. Then 


UG Py iP) 
= d_(sup f) Vol(Ri) — ) [Ginf f) Vol(Rs) 


uv 


= >o (sup f — inf f) Vol(Ri) + (sup f —inf f) Vol(Ri) 


RiEe - R,ED a 
< (sup f — inf f) S~ Vol(Ri) + 2e S> Vol(R;) 
A REC R,ED 


< (sup f — inf f)e + 2e Vol(A). 
A A 


It follows that 


[swf sav < (sup f — inf f)e + 2¢ Vol(A), 
A JA A A 
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which can be made as small as desired by taking ¢ sufficiently small. This 
implies that the upper and lower integrals of f must be equal, so f is 
integrable. 


Remark. In fact, the Lebesgue criterion is both necessary and sufficient for 
Riemann integrability, but we will not need that. 


Now suppose D C R” is any bounded set, and f: D — R is a bounded 
function. Let A be any rectangle containing D, and define fp: A — R by 


fo(@)= ta ne D. (A.20) 


If the integral 


| fodV (A.21) 
A 


exists, f is said to be integrable over D, and the integral (A.21) is denoted 
by Jp f dV and called the integral of f over D. It is easy to check that the 
value of the integral does not depend on the rectangle chosen. 

In practice, we will be interested only in integrals of bounded continuous 
functions. However, since we will sometimes need to integrate them over 
domains other than rectangles, it is necessary to consider also integrals of 
discontinuous functions such as the function fp defined by (A.20). The 
main reason for proving Proposition A.31 is that it allows us to give a 
simple description of domains on which all bounded continuous functions 
are integrable. 

A subset D Cc R” will be called a domain of integration if D is bounded 
and 0D has n-dimensional measure zero. It is easy to check (using Lemma 
A.30) that any set whose boundary is contained in a finite union of proper 
affine subspaces is a domain of integration, and finite unions and inter- 
sections of domains of integration are again domains of integration. Thus, 
for example, any finite union of open or closed rectangles is a domain of 
integration. 


Proposition A.32. If D C R” is a domain of integration, then every 
bounded continuous function on D is integrable over D. 


Proof. Let f: D — R be bounded and continuous, and let A be a rectangle 
containing D. To prove the theorem, we need only show that the function 
fp: A — R defined by (A.20) is continuous except on a set of measure 
zero. 

If « € Int D, then fp = f on a neighborhood of x, so fp is continous at 
x. Similarly, if ¢ € A\ D, then fp = 0 on a neighborhood of x, so again 
f is continuous at x. Thus the set of points where fp is discontinuous is 
contained in OD, and therefore has measure zero. 
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Of course, if D is compact, then the assumption that f is bounded in 
the preceding proposition is superfluous. 
If D is a domain of integration, the volume of D is defined to be 


Vol(D) = I 1dV. 


The integral on the right-hand side is often abbreviated [ pd. 
The next two propositions collect some basic facts about volume and 
integrals of continuous functions. 


Proposition A.33 (Properties of Volume). Let D C R” be a domain 
of integration. 


(a) Vol(D) > 0, with equality if and only if D has measure zero. 
(b) If Di,..., Dz are domains of integration whose union is D, then 
Vol(D) < Vol(D1) +---+ Vol(Dx), 
with equality if and only if Di D; has measure zero for each i, j. 


(c) If D, is a domain of integration contained in D, then Vol(D1) < 
Vol(D), with equality if and only if D\ Dy has measure zero. 


Proposition A.34 (Properties of Integrals). Let D C R” be a do- 
main of integration, and let f,g: D— R be continuous and bounded. 


(a) For any a,be€E R, 


[tatrrav=af pave [ gav. 


(b) If D has measure zero, then J, f dV = 0. 


(c) If Di,...,Dx are domains of integration whose union is D and whose 
pairwise intersections have measure zero, then 


[ fav = fav #o04-] fav. 
D Dy Dy 


(d) If f > 0 on D, then J,, fdV > 0, with equality if and only if f = 0 
on Int D. 


(e) (inf f) Vol(D) < i: faV < (sup f) Vol(D). 


Wn [rays f iniav. 
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Exercise A.33. Prove Propositions A.33 and A.34. 


There are two more fundamental properties of multiple integrals that we 
will need. The proofs are too involved to be included in this summary, but 
you can look them up in the references listed at the beginning of this section 
if you are interested. Each of these theorems can be stated in various ways, 
some stronger than others. The versions we give here will be quite sufficient 
for our applications. 


Theorem A.35 (Change of Variables). Suppose D and E are com- 
pact domains of integration in R”, and G: D — E is a continuous map 
such that Glmtp: Int D — Int E is a bijective C! map with C! inverse. 
For any bounded continuous function f: ER, 


aGi 
[tv=f[ir5ce act (SS) | av 


Theorem A.36 (Evaluation by Iterated Integration). Suppose 
E Cc R®” is a compact domain of integration and go,gi:: E — R are 
continuous functions such that go < gi everywhere on E. Let D Cc R"*+ 
be the subset 


D={(2',...,2",y) ER"): 2 € E and go(z) <y < m(z)}. 


Then D is a domain of integration, and 


[rv=f ( J i jee ay) av. 


Of course, there is nothing special about the last variable in this formula; 
an analogous result holds for any domain D that can be expressed as the 
set on which one variable is bounded between two continuous functions of 
the remaining variables. 

If the domain F in the preceding theorem is also a region between two 
graphs, the same theorem can be applied again to E. In particular, the 
following formula for an integral over a rectangle follows easily by induction. 


Corollary A.37. Let A = [a,b] x --- x [a",b”] be a closed rectangle in 
R”, and let f: A— R be continuous. Then 


[rw fr (--( PT Fee at)! +) an 


and the same is true if the variables in the iterated integral on the right- 
hand side are reordered in any way. 
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Sequences and Series of Functions 


We conclude with a summary of the most important facts about sequences 
and series of functions on Euclidean spaces. 

Let S Cc R” be any subset, and for each integer i > 1 suppose that 
fi: S — R™ is a function on S. The sequence {f;} is said to converge 
pointwise to f: S — R™ if for each a € S and each € > 0, there exists an 
integer N such that i > N implies | f;(a) — f(a)| < ¢. Pointwise convergence 
is denoted simply by f; — f. The sequence is said to converge uniformly 
to f if N can be chosen independently of the point a: for each ¢ > 0 there 
exists N such that i > N implies |f;(a) — f(a)| < € for every a € U. We 
will usually indicate uniform convergence by writing “f; — f uniformly.” 


Theorem A.38 (Properties of Uniform Convergence). Let S Cc 
R”, and let fj: S — R™ for each integer i > 1. 


(a) If each f; is continuous and f; > f uniformly, then f is continuous. 


(b) If each f; is continuous and f; > f uniformly, then for any closed 
domain of integration DC S, 


lim fav =f fav. 
D D 


41—Co 


(c) If S is open, each f; is of class C!, f; — f pointwise, and the sequence 
{Of;/Ox1} converges uniformly on S as i > ov, then Of /Ox) exists 
on S and 


OF. lim Ofi 


ae ee 1 7 
Oxi isoo Oxd 


For a proof, see [Rud76]. An infinite series of functions }>7°, fi on S.C R” 
is said to converge pointwise to a function g if the corresponding sequence 
of partial sums converges pointwise: 


M 


g(x) = lim S- fila) for alla ec S. 


M-~w- 
i=0 
The series is said to converge uniformly if the partial sums converge uni- 
formly. 


Proposition A.39 (Weierstrass M-test). Suppose S C R” is any sub- 
set, and f;: S — R*® are functions. If there exist positive real numbers M; 
such that supg |fil < Mi and ae M; converges, then Dee fi. converges uni- 
formly on A. 


Exercise A.34. Prove Proposition A.39. 
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is free, 160 
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local one-parameter, 313 
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left, 145 
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proper, 147 
of a discrete group, 157 
properly discontinuous, 158 
right, 146 
smooth, 146 


transitive, 147 
adapted 
chart, 154 
orthonormal frame, 192, 262 
addition, vector, 403 
adjoint matrix, 150 
Ado’s theorem, 398 
affine 
singular 
simplex, 292 
subspace, 407 
algebra, Lie, 371 
induced homomorphism, 
379 
of a Lie group, 373 
product, 372 
along a submanifold, 123 
Alt (alternating projection), 205 
Alt convention, 212 
alternating 
projection, 205 
tensor, 202 
and orientation, 231 
basis for space of, 207 
elementary, 206 
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alternative definitions of the 
tangent space, 55 
angle, 186, 421 
function, 14 
antisymmetry of bracket, 371 
approximation 
linear 
and the differential, 74 
approximation linear, 41 
approximation theorem 
Whitney, 138 
on manifolds, 142 
atlas, 8 
for a manifold with 
boundary, 20 
smooth, 8 
complete, 9 
maximal, 9 
autonomous system of ODEs, 
326 


backward reparametrization, 81 


Baker-Campbell-Hausdorff 
formula, 392 
ball 
closed, 423 
is a manifold with 
boundary, 21, 40 
coordinate, 4 
open, 423 
band, Mobius, 265 
base 
of a covering, 28 
of a vector bundle, 59 
basis 
dual, 66 
standard, for R”, 66 
for a vector space, 405 
isomorphism, 11, 408 
ordered, 406 
representation of a vector, 
406 
standard, 406 
bilinear, 172 
form, 173 


block upper triangular, 420 
boundary 
induced volume form, 262 
manifold, 20 
manifold with, 19 
push-forward, 53 
tangent space, 53 
topological, 19 
vector field on, 62 
of a manifold with 
boundary, 19, 128 
disjoint from interior, 20 
is a manifold, 21 
is a submanifold, 120 
of a singular simplex, 292 
operator, singular, 292 
singular, 292 
topological, 20 
bounded linear map, 424 
bracket 
commutator, 372 
in a Lie algebra, 371 
Lie, 329 
antisymmetry, 330 
bilinearity, 330 


coordinate expression, 329 


equals Lie derivative, 333 
is smooth, 329 
Jacobi identity, 330 
naturality, 331 
tangent to submanifold, 
331 
Poisson, 348 
on R”, 348 
Bredon, Glen E., vi 
bump function, 35, 39 
existence, 38 
bundle 
cotangent, 69 
is a vector bundle, 69 
isomorphic, 60 
isomorphism, 60, 127 
Mobius, 265 
map, 60, 192 
bijective, 127 


normal, 139 
is a submanifold, 139 

of tensors, 179 

tangent, 57 
is a vector bundle, 59 
projection, 57 
smooth structure, 57 
standard coordinates, 58 
trivial, 62 

tensor, 180 

trivial, 59 
tangent, 62 

vector, 58 
projection is a 

submersion, 95 

restriction of, 59 
section of, 59 


C(M) (space of continuous 
functions), 40 
C* (nonzero complex numbers), 
31 
C* (continuously differentiable), 
425 
C* (k times continuously 
differentiable), 425 
manifold, 9 
structure, 9 
C2(U), 425 
CP” (complex projective space), 
169 
C@ (infinitely differentiable), 7, 
425 
structure, 9 
C%(M), 24 
C%(U), 425 
C™” structure, 9 
calibrated submanifold, 304 
calibration, 304 
canonical 
coordinates, 349 
form 
for commuting vector 
fields, 337 
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for nonvanishing vector 
field, 331 
for symplectic tensor, 221 
symplectic structure on 
T*M, 223 
Cayley numbers, 401 
centered at a point, 4 
chain 
complex, 286 
singular, 293 
group 
singular, 292 
smooth, 296 
singular, 292 
smooth, 296 
chain rule 
for partial derivatives, 431 
for total derivatives, 429 
change of variables, 438 
characterization of homogeneous 
spaces, 162 
chart, 4 
centered at a point, 4 
generalized, 19 
negatively oriented, 232 
oriented, 232 
positively oriented, 232 
slice, 97 
smooth, 7, 10 
on a manifold with 
corners, 252 
with corners, 252 
with corners, 252 
smooth, 252 
smoothly compatible, 252 
circle, 6, 14 
as a Lie group, 31 
de Rham cohomology, 281 
group, 31 
Lie algebra of, 374 
subgroup of C*, 124 
homeomorphic to square, 7 
not diffeomorphic to square, 
127 
class C1, 425 
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class C*, 425 
class C'°, 425 
closed 
1-form, 85 
coordinate independence, 
85 
ball, 423 
is a manifold with 
boundary, 21, 40 
covector field, 85, 214 
coordinate independence, 
85 
vs. exact, 85-87 
curve segment, 82 
form, 219 
local exactness, 279 
rectangle, 433 
subgroup, 124 
is embedded, 394 
subgroup theorem, 392 
submanifold, 126 
closest point to a submanifold, 
144 
cochain 
complex, 286 
homotopy, 274 
map, 286 
codimension, 97 
cofactor, 419 


coframe 
coordinate, 71 
dual, 71 
global, 71 
local, 71 
cohomologous, 273 
cohomology 
class, 273 
de Rham, 272 
of nonorientable manifold, 
281 
of orientable manifold, 
279 
with compact support, 
284 


group, de Rham, 272 


map, induced, 273 
of a complex, 285 
singular, 295 
coisotropic 
immersion, 222 
submanifold, 222 
subspace, 220 
column 
operations, elementary, 417 
rank, 413 
combination, linear, 404 
commutator bracket, 372 
commute, 335 
commuting vector fields, 335, 
336 
canonical form, 337 
compact support, 34 
de Rham cohomology with, 
284 
compactly supported 
forms, 282 
function, 34 
section, 59 
compatible, smoothly, 7 
charts with corners, 252 
complement 
orthogonal, 423 
symplectic, 220 
complementary 
subspace, 407 
complete 
smooth atlas, 9 
vector field, 316 
on a compact manifold, 
317 
completely integrable, 359 
vs. involutive, 359 
complex, 285 
analytic structure, 9 
chain, 286 
cochain, 286 
general linear group, 31 
Lie algebra of, 402 
manifold, 9 
projective space, 169 


short exact sequence of, 286 
singular chain, 293 
special linear group, 150 
component 
covector, transformation 
law, 68, 69, 77 
functions, 424 
of a covector field, 70 
of a tensor field, 180 
of a vector field, 60 
identity, of a Lie group, 167 
map, 26 
of a covector, 66 
of a vector, 50 
vector, transformation law, 
52, 69 
with respect to a basis, 406 
composite function, differential 
of, 74 
composition 
and tangent vectors, 55 
of embeddings, 97 
of immersions, 97 
of smooth maps, 25 
of submersions, 97 
computation in coordinates, 49 
connected 
manifold 
points joined by piecewise 
smooth curves, 79 
points joined by smooth 
curves, 144 
sum, 128 
connecting homomorphism, 286 
conservative, 82 
vector field, 91 
vs. exact, 83 
consistently oriented, 230 
charts, 232 
constant 
function, and zero 
differential, 74 
rank, 94 
and diffeomorphism, 131 
and immersion, 111 


Index 449 


and submersion, 131 
image, 127 
level set theorem, 113 
continuous pointwise orientation, 
232 
continuously differentiable, 425 
contractible, 278 
contraction, 105, 235 
lemma, 105 
contravariant 
functor, 67 
tensor, 179 
tensor field, 180 
vector, 69 
convergence 
of series of functions, 439 
pointwise, 439 
uniformly, 439 
coordinate 
ball, 4 
chart, 4 
centered at a point, 4 
smooth, 7 
coframe, 71 
computations, 49 
domain, 4 
map, 4 
smooth, 10 
neighborhood, 4 
representation 
for a point, 18 
of a function, 24 
of a map, 25 
subspace, 413 
vector, 50 
transformation law, 52, 68 
vector field, 61 
coordinates, 406 
canonical, 349 
Darboux, 349 
local, 4 
slice, 97 
spherical, 104 
standard 
on R”, 11 
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on the tangent bundle, 58 
stereographic, 21 
symplectic, 349 

corner point, 252, 253 
corners 
chart with, 252 

smoothly compatible, 252 
smooth manifold with, 252 
smooth structure with, 252 

correspondence 
between Lie groups and Lie 
algebras, 398 
coset, 407 
cotangent 
bundle, 69 
canonical symplectic 
structure, 223 

is a vector bundle, 69 

symplectic structure, 222 
space, 68 
tangent isomorphism, 192, 

197 
not canonical, 198 
countable 
group, 32 
second 

product, 4 

subspace, 4 
subcover, 4 

counterclockwise, 230 
covariant 
tensor, 172 
tensor field, 180 
transformation law, 196 
vector, 69 
covector, 65, 204 
components, 66 
transformation law, 68, 
69, 77 
field, 70 

closed, 85, 214 

closed vs. exact, 85-87 

conservative, 82 

conservative vs. exact, 83 

exact, 82 


pullback, 76 
restriction, 123 
smooth, 70 
smoothness criteria, 70 
space of, 71 
tangent, 68 
covector field 
exact, on the torus, 91 
integral 
diffeomorphism 
invariance, 78 
in R, 78 
cover, regular open, 36 
covered, evenly, 28 
covering 
base of, 28 
group, 157, 380 
acts properly, 158 
has isomorphic Lie 
algebra, 380 
is a Lie group, 158 
Lie, 380 
universal, 380 
homomorphism, 380 
manifold, smooth structure, 
29 
map, 28 
injective, 28 
topological, 28 
orientation, 265 
smooth, 28 
is a local diffeomorphism, 
28 
is an immersion, 95 
is open, 28 
local sections of, 28 
smooth maps from base, 
29 
vs. topological, 28 
space, 28 
of Lie group, 167 
orientable, 265 
transformation, 157 
Cramer’s Rule, 31 
critical 


point, 113 
value, 114 
cross product, 172 
cube, 433 
symmetry group of, 161 
cup product, 303 
curl, 263 
curvature, 1 
curve 
and the differential, 74 
closed, 82 
in submanifold, 128 
integral, 307 
is immersed, 313 
length of 
in R”, 90 
segment, 79 
closed, 82 
length of, 186 
piecewise smooth, 79 
smooth, 79 
smooth, 54 
is an immersion, 94 
tangent vector to, 54 
cusp, 104 
cutoff function, 35 
cycle, singular, 292 


d (differential of a function), 72 
d (exterior derivative), 215 
6 (Kronecker delta), 66 
67 (Kronecker delta for 
multi-indices), 206 
6-close, 138 
Darboux 
coordinates, 349 
theorem, 349 
de Rham 
basis, 300 
cohomology, 86, 272 
diffeomorphism 
invariance, 273 
functoriality, 273 
homotopy invariance, 276 
induced map, 273 
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is a topological invariant, 
274 
of a simply connected 
manifold, 279 
f disjoint union, 277 
of Euclidean space, 279 
of nonorientable manifold, 
281 
of orientable manifold, 
279 
of spheres, 290 
of the circle, 281 
of zero-manifolds, 278 
top dimensional, 279, 281 
topological invariance, 
277 
with compact support, 
284 
zero-dimensional, 278 
cover, 300 
group, 272 
homomorphism, 298 
manifold, 300 
theorem, 300 
decomposable, 225, 235 
defining 
function, 116 
existence, 115 
local, 116 
map, 116 
and tangent space, 116 
local, 116 
representation 
of GL(n, C), 33 
of GL(n, R), 33 
delta, Kronecker, 66 
dense 
curve on the torus, 95, 126 
as immersed submanifold, 
119 
is not embedded, 126 
subgroup of the torus, 125 
dependent, linearly, 404 
derivation, 43, 45 
derivative 


° 
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directional, 43 
exterior, 214, 215, 217 
is local, 216 
Lie 
of a tensor field, 339 
of a vector field, 328 
of a determinant, 91 
of a map, 51 
partial, 425 
higher order, 425 
order of, 425 
second order, 425 
total, 428 


determinant, 172, 202 


and volume, 225 

convention, 212 

derivative of, 91 

differential of, 92 

is a Lie group 
homomorphism, 33 

is a submersion, 115 

is a tensor, 173 

of a matrix, 415 


diffeomorphic, 26 


is an equivalence relation, 
26 


diffeomorphism 


and constant rank, 131 

between Euclidean spaces, 7 

between manifolds, 26 

group acts transitively, 325 

invariance of de Rham 
cohomology, 273 

local, 26 


differentiable, 428 


continuously, 425 
infinitely, 425 
structure, 9 


differential 


and linear approximation, 
74 
and push-forward, 75 
equation 
and the Frobenius 
theorem, 361 


integrating, 309 

ordinary, 308, 309 
form, 70, 212 

and orientation, 233 

closed, 85 

closed vs. exact, 85-87 

conservative, 82 


conservative vs. exact, 83 


exact, 82 
left-invariant, 375 
Lie derivative, 341 
geometry, v 
ideal, 370 
of a constant, 73 
of a determinant, 92 
of a function, 72 


of a map, 51 
of a product, 73 
of a quotient, 73 
of a sum, 73 
of function composed with 
curve, 74 
zero, 74 
differentiation, exterior, 217 
dimension, 406 
invariance of, 64 
of a tangent distribution, 
356 
direct sum, 407 
directional derivative, 43 
in R”, 43, 431 
of vector field, 327 
discontinuous, properly, 158 
discrete 
group, 32 
action, 149 
proper action, 157 
quotient by, 159 
subgroup, 160 
quotient by, 160 
distance on a Riemannian 
manifold, 188 
distribution, 356 
and smooth sections, 356 


coordinate formula, 72, 73 


determined by a foliation, 
365 
dimension of, 356 
examples, 357 
integrable, 358 
is involutive, 358 
involutive, 358 
spanned by vector fields, 
357 
tangent, 356 
divergence, 262, 344 
and volume decreasing 
flows, 345 
and volume increasing flows, 
345 
and volume preserving 
flows, 345 
on nonoriented manifold, 
266 
product rule, 267 
theorem, 262 
domain 
coordinate, 4 
flow, 313 
of integration, 436 
in a manifold, 245 
in the boundary of a 
manifold with corners, 
254 
regular, 238 
restricting, 121 
Donaldson, Simon, 27 
dot product, 12, 172, 421 
dual 
basis, 66 
standard, for R”, 66 
coframe, 71 
homomorphism, 295 
map, 67 
space, 66 
second, 67 
dV (in integral notation), 434 
dV, (Riemannian volume form), 
258 
dynamical systems, 333 
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e (identity of a Lie group), 30 
E(n) (Euclidean group), 161 
eigenfunction of Laplace 
operator, 267 
eigenvalue of Laplace operator, 
267 
Einselement, 30 
Einstein summation convention, 
12 
element, function, 56 
elementary 
column operations, 417 
k-covector, 206 
matrix, 417 
row operations, 417 
embedded 
subgroup, 124 
submanifold, 97 
closed, 126 
image of embedding, 98, 
118 
local characterization, 97 
of a manifold with 
boundary, 120 
open, 97 
uniqueness of smooth 
structure, 98 
embedding, 94 
composition of, 97 
image is an embedded 
submanifold, 118 
smooth, 94 
theorem, Whitney, 136 
strong, 137 
topological, 94 
equation, variational, 321 
equivalent norms, 423 
equivariant 
map, 149 
rank theorem, 150 
escape lemma, 316 
Euclidean 
dot product, 12 
group, 161 
inner product, 421 
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locally, 3 
metric, 185 
space, 11, 406 
as a Lie group, 31 
as a manifold, 11 
Lie algebra of, 374 
smooth structure, 11 
standard coordinates, 11 
uniqueness of smooth 
structure, 27 
evaluation map, 373 
even permutation, 414 
evenly covered, 28 
exact 
1-form, 82 
covector field, 82 
on the torus, 91 
form, 219 
functor, 296 
sequence, 285 
of complexes, 286 
vs. conservative, 83, 85-87 
existence of Riemannian metric, 
194 
expansion by minors, 419 
exponential map, 385 
and one-parameter 
subgroups, 385 
is a local diffeomorphism, 
385 
of GL(n, R), 385 
of a Lie group, 385 
push-forward, 385 
smoothness, 385 
exponential of a matrix, 383 
extension 
lemma, 39 
of smooth function, 39 
exterior 
derivative, 214, 215, 217 
and Lie brackets, 352 
and pullback, 218 
is local, 216 
differentiation, 217 
product, 209 


F* (pullback) 
of covectors, 75 
of forms, 213 
of functions, 40 
of tensors, 181 
F, (push-forward) 
in coordinates, 50 
induced Lie algebra 
homomorphism, 378 
of vector fields, 63 
of vectors, 46 
F-related, 63 
face map, 292 
faithful representation, 398 
fake R*, 27 
fiber 
of a map, 112 
of a vector bundle, 58 
field 
plane, 356 
figure eight, 95 
as immersed submanifold, 
119 
is not embedded, 126 
finite group 
as a Lie group, 32 
finite, locally, 36 
finite-dimensional 
representation, 33 
vector space, 405 
first-order system of PDEs, 362 
flag, 164 
manifold, 164 
flat, 193 
chart, 359 
for a foliation, 365 
metric, 187, 198 
on torus, 199 
flow, 313 
domain, 313 
fundamental theorem on, 
314 
proof, 315 
generated by a vector field, 
314 


global, 310 
arises from vector field, 
311 
is orientation preserving, 
326 
local, 313 
maximal, 314 
volume decreasing, 345 
and divergence, 345 
volume increasing, 345 
and divergence, 345 
volume preserving, 345 
and divergence, 345 
foliation, 365 
determines involutive 
distribution, 365 
examples, 365 
leaves of, 365 
form 
bilinear, 173 
closed, 85 
vs. exact, 85-87 
conservative, 82 
vs. exact, 83 
differential, 70, 212 
and orientation, 233 
exact, 82 
vs. closed, 85-87 
vs. conservative, 83 
left-invariant, 375 
Lie derivative, 341 
forward reparametrization, 81 
frame 
global 
and trivial bundle, 64 
for a manifold, 62 
left-invariant, 375 
local, 60 
and local trivialization, 64 
for a manifold, 62 
negatively oriented, 232 
oriented, 232 
orthonormal, 188 
adapted, 192, 262 
existence, 188 
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positively oriented, 232 
free 
group action, 147 
vector space, 175 
characteristic property, 
175 
Freedman, Michael, 27 
Frobenius theorem, 359 
and partial differential 
equations, 361 
global, 366 
proof, 367 
function 
element, smooth, 56 
real-valued, 23 
smooth 
coordinate representation, 
24 
extension of, 39 
on a manifold, 24 
vector-valued, 23 
vs. map, 23 
functional, linear, 65 
functor, 67 
exact, 296 
fundamental correspondence 
between Lie groups and 
Lie algebras, 398 
fundamental theorem for line 
integrals, 81 
fundamental theorem of Sophus 
Lie, 398 
fundamental theorem on flows, 
314 
proof, 315 


GL*(n,R), 124 

GL(n, C), see complex general 
linear group 

GL(n,R), see general linear 


group 
GL(V), 31 
G-space, 146 


homogeneous, 161 
left, 146 
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right, 146 
Gauss’s theorem, 262 
general linear group, 13, 31 
complex, 31 
Lie algebra of, 402 
components, 165 
connected, 168 
is a manifold, 13 
Lie algebra of, 376 
natural action, 148 
one-parameter subgroups, 
383 
general position, 291 
generalized chart, 19 
generator, infinitesimal 
of a flow, 313 
of a global flow, 311 
geometric 
simplex, 291 
tangent space, 42 
tangent vector, 42 
geometry 
differential, v 
Riemannian, 187 
germ, 56 
global 
coframe, 71 
flow, 310 
arises from vector field, 
311 
frame 
and trivial bundle, 64 
for a manifold, 62 
Frobenius theorem, 366 
proof, 367 
trivialization, 59 
and global frame, 64 
globally Hamiltonian, 347 
gradient, 71, 193 
orthogonal to level set, 199 
graph 
coordinates 
on spheres, 13 
Riemannian metric in, 


191 


is an embedded 
submanifold, 103 
of a smooth function, 103 
Grassmann manifold, 15, 17, 164 
is compact, 169 
Grassmannian, 15, 17, 164 
is compact, 169 
Green’s 
identities, 267 
theorem, 251 
group 
action, 145 
free, 147 
local one-parameter, 313 
one-parameter, 310 
orientation-preserving, 


265 
proper, 147 
properly discontinuous, 
158 
smooth, 146 
transitive, 147 
circle, 31 


circle, subgroup of C*, 124 
complex general linear, 31 
de Rham, 272 
discrete, 32 
general linear, 31 
injective, 296 
laws, 310 

of a flow, 313 
Lie, 30 
special linear, 115 
symmetric, 414 
topological, 30 


H” (upper half space), 19 
Haar 

integral, 376 

volume form, 376 
half space, upper, 19 
Hamiltonian 

globally, 347 

locally, 347 

vector field, 346, 347 


on R2”, 346 
tangent to level sets, 347 
harmonic function, 267 
Hausdorff 
product, 3 
space, 3 
subspace, 3 
Hermitian, 151 
higher-order partial derivative, 
425 
Hodge star operator, 268 
Hom(V,W) (space of linear 
maps), 196 
homeomorphism 
smooth, 99 
homeomorphism, smooth, 99 
homogeneous 
G-space, 161 
manifold, 161 
space, 161 
characterization theorem, 
162 
homologous submanifolds, 303 
homology 
of a complex, 286 
singular, 292 
smooth, 296 
homomorphism 
covering, 380 
dual, 295 
induced Lie algebra, 379 
Lie, 32 
Lie algebra, 372 
induced, 378 
Lie group, 32 
is equivariant, 149 
homotopic, 142 
maps are smoothly 
homotopic, 143 
path, 255 
relative to a set, 142 
smoothly, 143 
homotopy, 142 
cochain, 274 
equivalence, 274 
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equivalent, 274 
invariance of de Rham 
cohomology, 276 
inverse, 274 
operator, 274 
relative to a set, 142 
Hopf map, 170 
hyperplane, 71 
hypersurface, 235 
in a Riemannian manifold, 
260 
induced volume form, 260 
orientability, 237 


I, (identity matrix), 409 
ideal, differential, 370 
identity 
component, 167 
matrix, 409 
of a Lie group, 30 
image, 407 
of a Lie group 
homomorphism, 169 
of an embedding, 118 
of constant rank map, 127 
immersed submanifold, 119 
of a manifold with 
boundary, 120 
immersion, 94 
and constant rank, 111 
between manifolds with 
boundary, 120 
composition of, 97 
is locally an embedding, 120 
theorem, Whitney, 134 
strong, 137 
vs. embedding, 96 
implicit function theorem, 109 
improper integral, 241 
increasing multi-index, 207 
independent, linearly, 404 
index 
conventions, 12 
for covectors, 67 
lower, 12 
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position, 12 
upper, 12 
induced 
cohomology map, 273 
Lie algebra homomorphism, 
378, 379 
metric, 191 
orientation 
on a boundary, 239 
orientation on a boundary, 
239 
infinite-dimensional vector 
space, 405 
infinitely differentiable, 425 
infinitesimal generator, 311, 313 
invariant under flow, 312, 
313 
of a flow, 313 
of a global flow, 311 
injective 
group, 296 
immersion vs. embedding, 
96 
inner product, 184, 421 
Euclidean, 421 
space, 421 
integrable, 434 
completely, 359 
vs. involutive, 359 
distribution, 358 
is involutive, 358 
over a bounded set, 436 
integral 
curve, 307 
and one-parameter 
subgroup, 382 
is immersed, 313 
improper, 241 
line, 78, 79 
fundamental theorem, 81 
of a vector field, 91 
parameter independence, 
81 
lower, 434 
manifold, 357 


union of, 366 
uniqueness, 360 
of a covector field 
diffeomorphism 
invariance, 78 
in R, 78 
of a differential form 
computing, 245 
diffeomorphism 
invariance, 241, 245 
linearity, 244 
orientation reversal, 244 
over a manifold, 242 
over a smooth chain, 297 
over a smooth simplex, 
297 
positivity, 244 
of a function 
on a rectangle, 434 
on a Riemannian 
manifold, 260 
over a bounded set, 436 
of an n-form in R”, 240 
on a manifold, 243 
on a manifold with corners, 
253 
over a 0-manifold, 244 
over a boundary, 244 
over a submanifold, 244 
upper, 434 
integrating a system of ODEs, 
309 
integration 
by parts, 267 
domain of, 436 
in a manifold, 245 
iterated, 438 
on Lie groups, 375 
interior 
multiplication, 235 
is an antiderivation, 235 
of a manifold with 
boundary, 19, 128 
disjoint from boundary, 
20 


is a manifold, 21 
intersection, transverse, 128 
intertwine, 149 
invariant 

left-, 373 
tensor field, 343 
and Lie derivative, 343 
under a flow, 343 
vs. Lie derivative, 343 
vector field, 312, 335 
inverse 
function theorem, 105 
map, smoothness of, 99 
of a matrix, 410 
inversion map, 30 
invertible 
linear map, 408 
matrix, 409 
involutive, 358 
distribution 


and differential forms, 369 


vs. completely integrable, 
359 

inward-pointing, 238 
isometric, 187 
isometry, 187, 422 
isomorphic 

Lie algebras, 373 

locally, 398 

vector bundles, 60 

vector spaces, 408 
isomorphism, 408 

basis, 408 

bundle, 60 

determined by a basis, 11 

Lie, 32 

Lie algebra, 373 

Lie group, 32 

vector bundle, 60 
isotropic 

immersion, 222 

submanifold, 222 

subspace, 220 
isotropy group, 147 
iterated integration, 438 
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Jacobi identity, 330, 372 
for Poisson bracket, 353 
Jacobian matrix, 51, 425 


k-covector, 204 
elementary, 206 
k-form, 212 
k-slice 
in R”, 97 
in a manifold, 97 
kernel, 407 
of Lie homomorphism 
is a subgroup, 150 
Kervaire, Michel, 27 
Killing field, 344 
Euclidean, 344, 352 
kink, 104 
Kronecker delta, 66 
for multi-indices, 206 


L, (left translation), 32 
Lagrangian 
immersion, 222 
submanifold, 222 
and closed 1-form, 223 
subspace, 220 
Laplace operator, 267 
Laplacian, 267 
leaf of a foliation, 365 
Lebesgue measure, 434 
left 
action, 145 
G-space, 146 
translation, 32, 373 
left-invariant 
differential form, 375 
frame, 375 
tensor, 375 
vector field, 373 
is complete, 381 
length 
of a curve segment, 186 
in R”, 90 
isometry invariant, 187 
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parameter independence, 
186 
of a tangent vector, 186 
of a vector, 421 
level set, 104 
of submersion, 113 
regular, 114 
is a submanifold, 114 
of a real-valued function, 
114 
theorem, 114 
theorem, constant rank, 113 
Lie 
algebra, 371 
abelian, 372 
and one-parameter 
subgroups, 382 
homomorphism, 372, 396 
homomorphism, induced, 
378, 379 
isomorphic, 373 
isomorphism, 373 
of SL(n, R), 388 
of a Lie group, 373 
of a subgroup, 379, 387 
product, 372 
representation, 398 
bracket, 329 
antisymmetry, 330 
bilinearity, 330 
coordinate expression, 329 
equals Lie derivative, 333 
is smooth, 329 
Jacobi identity, 330 
naturality, 331 
tangent to submanifold, 
331 
covering group, 380 
has isomorphic Lie 
algebra, 380 
derivative 
and invariant tensor field, 
343 
equals Lie bracket, 333 
of a tensor field, 339 


of a vector field, 328 
of differential form, 341 
group, 30 
countable, 32 
covering of, 167 
discrete, 32 
finite, 32 
homomorphism, 32, 396 
homomorphism is 
equivariant, 149 
homomorphism, image of, 
169 
homomorphism, with 
discrete kernel, 167 
identity component, 167 
integration on, 375 
is orientable, 375 
is parallelizable, 375 
isomorphism, 32 
product of, 32 
simply connected, 397 
homomorphism, 32 
isomorphism, 32 
subalgebra, 372 
subgroup, 124 
associated with Lie 
subalgebra, 394 
closed, 124 
embedded, 124 
Lie, Sophus, 398 
fundamental theorem, 398 
line integral, 78, 79 
fundamental theorem, 81 
of a vector field, 91 
parameter independence, 81 
line with two origins, 21 
linear 
approximation, 41 
and the differential, 74 
combination, 404 
functional, 65 
group 
complex general, 31 
complex special, 150 
general, 31 


special, 115, 125 
map, 407 
over C™®(M), 90 
system of PDEs, 362 
linearly 
dependent, 404 
independent, 404 
Lipschitz 
constant, 432 
continuous, 432 
local 
coframe, 71 
coordinate map, 4 
coordinate representation 
for a point, 18 
coordinates, 4 
defining function, 116 
existence, 115 
defining map, 116 
diffeomorphism, 26 
exactness of closed forms, 
279 
flow, 313 
frame, 60 
and local trivialization, 64 
for a manifold, 62 
orthonormal, 188 
isometry, 187 
one-parameter group action, 
313 
operator, 216 
parametrization, 191 
section, 28, 59 
existence of, 111 
of smooth covering, 28 
trivialization, 59 
and local frame, 64 
locally 
Euclidean, 3 
finite, 36 
cover, 36 
Hamiltonian, 347 
isomorphic, 398 
Lorentz metric, 2, 195 
lower 
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integral, 434 
sum, 433 
lowering an index, 193 


M(m x n,C) (space of complex 
matrices), 12 
M(m x n,R) (space of matrices), 
12 
M(n,C) (space of square 
complex matrices), 12 
M(n, R) (space of square 
matrices), 12 
manifold 
boundary, 20 
Cho 
complex, 9 
is metrizable, 191 
is paracompact, 37 
oriented, 232 
real-analytic, 9 
Riemannian, 184 
smooth, 1, 9 
is a manifold with 
boundary, 20 
with corners, 252 
topological, 1, 3 
with boundary 
boundary point, 128 
interior point, 128 
product of, 269 
push-forward, 53 
smooth atlas, 20 
smooth structure, 20 
submanifold of, 120 
tangent space, 53 
topological, 19 
vector field on, 62 
with corners, 252 
product of, 269 
Stokes’s theorem, 254 
map vs. function, 23 
map, smooth, between 
manifolds, 24 
mapping vs. function, 23 
matrices 
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of fixed rank, 116 
of maximal rank, 13 
matrix 
exponential, 383 
Lie algebra, 372 
of a linear map, 409 
skew-symmetric, 412 
symmetric, 115, 412 
upper triangular, 420 
maximal 
rank, matrices of, 13 
smooth atlas, 9 
maximal flow, 314 
Mayer-—Vietoris sequence, 288 
connecting homomorphism, 
289 
Mayer-Vietoris theorem 
for de Rham cohomology, 
288 
for singular cohomology, 295 
for singular homology, 294 
measure zero, 254 
in R”, 130, 434 
and smooth maps, 130, 
131 
in manifolds, 131 
and smooth maps, 132 
n-dimensional, 435 
submanifolds, 132 
metric, 184 
associated to a norm, 423 
Euclidean, 185 
flat, 187, 198 
induced, 191 
pseudo-Riemannian, 195 
Riemannian, 184 
existence, 194 
in graph coordinates, 191 
round, 191, 197 
in stereographic 
coordinates, 197 
space, 184 
metrizable, 191 
Milnor, John, 27 
minor of a matrix, 413 


mixed 
tensor, 179 
tensor field, 180 
Mobius 
band, 265 
bundle, 169, 265 
transformation, 162 
module, 404 
Moise, Edwin, 27 
Morse theory, 325 
Moser, Jurgen, 351 
multi-index, 206 
increasing, 207 
multilinear, 172, 415 
map 
and tensor product, 178 
over C'™(M), 197 
multiplication 
map, 30 
scalar, 403 
Munkres, James, 27 


n-dimensional measure zero, 435 
n-sphere, 5 
n-torus, 14 

as a Lie group, 32 
natural 

action of GL(n, R), 148 

action of O(n), 148 
naturality of the Lie bracket, 331 
negatively 

oriented, 231 

chart, 232 
frame, 232 
neighborhood, coordinate, 4 
nonautonomous system of 
ODEs, 326 

nondegenerate 2-tensor, 195, 219 
nonorientable manifold, 232 

de Rham cohomology, 281 
nonsingular matrix, 409 
norm, 423 

associated metric, 423 

equivalent, 423 

of a tangent vector, 186 


topology, 423 
normal 
bundle, 139, 199 
is a submanifold, 139 
is a vector bundle, 144 
orthonormal frame, 144 
outward-pointing, 261 
space, 139 
vector, 199 
vector field, 260 
normed linear space, 423 
north pole, 21 
nullity, 411 


O(n), see orthogonal group 
O(t*), 389 
odd 
permutation, 414 
ODE, see ordinary differential 
equation 
one-parameter group action, 310 
local, 313 
one-parameter subgroup, 381 
and integral curve, 382 
and Lie algebra, 382 
generated by X, 383 
of GL(n, R), 383 
of Lie subgroup, 384 
open 
ball, 423 
cover, regular, 36 
rectangle, 433 
submanifold, 12, 13 
is embedded, 97 
orientable, 234 
tangent space, 47 
orbit, 146 
is an immersed submanifold, 
167 
relation, 153 
space, 152 
order of a partial derivative, 425 
ordered basis, 406 
ordinary differential equation, 
308, 309 
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autonomous, 326 
continuity theorem, 320 
existence theorem, 314, 318 
integrating, 309 
nonautonomous, 326 
smoothness theorem, 314, 
320 
uniqueness theorem, 314, 
317 
orientability 
of hypersurfaces, 237 
of parallelizable manifolds, 
234 
orientable, 232 
Lie group, 375 
manifold, de Rham 
cohomology, 279 
open submanifold, 234 
orientation, 230 
and alternating tensors, 231 
and nonvanishing n-form, 
233 
covering, 265 
form, 233 
induced on a boundary, 239 
of a manifold, 232 
0-dimensional, 232, 233 
of a vector space, 231 
0-dimensional, 231 
pointwise, 232 
continuous, 232 
standard, of R”, 231 
Stokes, 239 
orientation-preserving, 234 
group action, 265 
orientation-reversing, 234 
oriented 
basis, 231 
chart, 232 
negatively, 232 
positively, 232 
consistently, 230 
form, 233 
frame, 232 
negatively, 232 
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positively, 232 
manifold, 232 
n-covector, 232 
negatively, 231 
vector space, 231 

orthogonal, 186, 421 
complement, 423 
group, 114, 125 

action on S"—!, 148 

components, 165 

is a Lie subgroup, 125 

is an embedded 

submanifold, 114 

Lie algebra of, 380 

natural action, 148 

special, 125 

special, is connected, 165 
matrix, 114 
projection, 423 

orthonormal 
basis, 422 
frame, 188 
adapted, 192, 262 
existence, 188 
outward-pointing, 238 
unit normal, 261 
vector field on boundary, 
238 
overdetermined, 362 


P” (real projective space), 6 
paracompact, 36 
manifolds are, 37 
parallelizable, 62, 375 
implies orientable, 234 
Lie group, 375 
spheres, 64 
torus, 64 
parametrization, 105 
local, 191 
partial 
derivative 
higher order, 425 
order of, 425 
second order, 425 


partial derivative, 425 
partial differential equations 
and the Frobenius theorem, 
361 
partition 
of a rectangle, 433 
of an interval, 433 
of unity, 37 
existence, 37 
passing to quotient, 112 
path homotopic, 255 
PDE (partial differential 
equation), 361 
period of a differential form, 303 
periodic curve, 325 
permutation, 183, 204, 206 
even, 414 
odd, 414 
piecewise smooth curve segment, 
79 
plane field, 356 
Poincaré lemma, 278 
for compactly supported 
forms, 282 
pointwise 
convergence, 439 
orientation, 232 
continuous, 232 
Poisson 
bracket, 348 
Jacobi identity, 353 
on R”, 348 
commute, 348 
polar coordinate map, 19 
pole 
north, 21 
south, 21 
positions of indices, 12 
positively oriented, 231 
chart, 232 
form, 233 
frame, 232 
n-form, 232 
potential, 82 
computing, 88 


precompact, 36 
product 
inner, 184, 421 
Lie algebra, 372 
manifold 
embedding into, 94 
projection is a 
submersion, 94 
smooth map into, 26 
smooth structure, 14 
tangent space, 64 
of Hausdorff spaces, 3 
f Lie groups, 32 
Lie algebra of, 375 
of manifolds with boundary, 
269 
of manifolds with corners, 
269 
of second countable spaces, 
4 
rule, 43 
semidirect, 169 
smooth manifold structure, 
14 
symmetric, 183 
associativity, 197 
projection 
of a vector bundle, 59 
of product manifold 
is a submersion, 94 
of the tangent bundle, 57 
onto a subspace, 408 
orthogonal, 423 
projective space 
complex, 169 
covering of, 167 
is a manifold, 6 
is Hausdorff, 6 
is second countable, 6 
orientability, 268 
quotient map is smooth, 25, 
26 
real, 6, 14 
smooth structure, 14 
proper 


° 
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action, 147 
of a discrete group, 157 
inclusion map 
vs. embedding, 126 
map, 96 
properly discontinuous, 158 
pseudo-Riemannian metric, 195 
pullback, 75 
of a 1-form, 76 
in coordinates, 77 
of a covector field, 76 
in coordinates, 77 
of a tensor field, 181 
in coordinates, 182 
of exterior derivative, 218 
of form, 213 
in coordinates, 213 
push-forward, 46 
and the differential, 75 
between manifolds with 
boundary, 53 
in coordinates, 50 
of a vector field, 62 
by a diffeomorphism, 63 
of tangent vector to curve, 
55 
smoothness of, 58 


QR. decomposition, 166 
quaternions, 401 
quotient 
by discrete group action, 
159 
by discrete subgroup, 160 
manifold theorem, 153 
of a vector space, 407 
of Lie group by closed 
normal subgroup, 167 
passing to, 112 
uniqueness of, 113 


R* (nonzero real numbers), 31 
R! 
fake, 27 
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nonuniqueness of smooth 
structure, 27 
Rg (right translation), 32 
R”, see Euclidean space 
RP” (real projective space), 6 
raising an index, 193 
range, restricting, 121 
embedded case, 122 
rank 
column, 413 
constant, 94 
level set theorem, 113 
matrices of maximal, 13 
of a linear map, 94, 411 
of a matrix, 412 
of a smooth map, 94 
of a tensor, 173 
of a vector bundle, 58 
row, 413 
theorem, 107 
equivariant, 150 
invariant version, 109 
rank-nullity law, 105 
real projective space, 6, 14 
is a manifold, 6 
is Hausdorff, 6 
is second countable, 6 
smooth structure, 14 
real vector space, 403 
real-analytic 
manifold, 9 
structure, 9 
real-valued function, 23 
rectangle, 433 
as a manifold with corners, 
252 
closed, 433 
open, 433 
refinement, 36 
regular, 37 
regular 
domain, 238 
level set, 114 
is a submanifold, 114 


of a real-valued function, 
114 
theorem, 114 
open cover, 36 
point, 113 
for a vector field, 331 
refinement, 37 
submanifold, 97 
value, 114 
related, see F-related 
relative homotopy, 142 
reparametrization, 81, 186 
backward, 81 
forward, 81 
of piecewise smooth curve, 
186 
representation 
defining 
of GL(n, C), 33 
of GL(n, R), 33 
faithful, 398 
finite-dimensional, 33 
of a Lie algebra, 398 
of a Lie group, 33 
restricting 
the domain of a map, 121 
the range of a map, 121 
embedded case, 122 
into a leaf of a foliation, 
361 
restriction 
of a covector field, 123 
of a vector bundle, 59 
of a vector field, 122 
retraction, 141 
Rham, de, see de Rham 
Riemann 
integrable, 434 
integral, 434 
Riemannian 
distance, 188 
geometry, 187 
manifold, 184 
as metric space, 189 
metric, 184 


existence, 194 
in graph coordinates, 191 
submanifold, 191 
volume element, 258 
volume form, 258 
in coordinates, 258 
right, 230 
action, 146 
is free, 160 
is proper, 160 
is smooth, 160 
G-space, 146 
translation, 32 
right-handed basis, 230 
round metric, 191, 197 
in stereographic 
coordinates, 197 
row operations, elementary, 417 
row rank, 413 


S!, see circle 
S (symmetric group), 204, 206 
SL(n, C) (complex special linear 
group), 150 
SL(n, R), see special linear group 
S”, see sphere 
S(n, R) (symmetric matrices), 
115 
SO(3) is diffeomorphic to P?, 168 
SO(n) (special orthogonal 
group), 125 
Sp(2n,R) (symplectic group), 
227 
SU(2) is diffeomorphic to S?, 167 
SU(n) (special unitary group), 
150 
Sard’s theorem, 132 
Sard, Arthur, 132 
scalar, 404 
multiplication, 403 
second countable, 3 
product, 4 
subspace, 4 
second-order partial derivative, 
425 
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section 
local, 28 
existence of, 111 
of a vector bundle, 59 
of smooth covering, 28 
of a map, 28 
of a vector bundle, 59 
smooth, 59 
zero, 59 
segment, curve, 79 
piecewise smooth, 79 
smooth, 79 
semidirect product, 169 
series of functions, convergence, 
439 
sgn (sign of a permutation), 204 
sharp, 193 
short exact sequence of 
complexes, 286 
sign of a permutation, 204 
signature of a bilinear form, 195 
simplex 
affine singular, 292 
geometric, 291 
singular, 292 
boundary of, 292 
smooth, 296 
standard, 291 
simply connected 
Lie group, 397 
manifold, cohomology of, 
279 
simply connected manifold, 279 
singular 
boundary, 292 
boundary operator, 292 
chain, 292 
complex, 293 
group, 292 
cohomology, 295 
cycle, 292 
homology, 292 
group, 292 
isomorphic to smooth 
singular, 296 
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smooth, 296 
matrix, 409 
point for a vector field, 331 
simplex, 292 
affine, 292 
boundary of, 292 
skew-symmetric matrix, 412 
slice 
chart, 97 
coordinates, 97 
in R”, 97 
in a manifold, 97 
smooth, 7 
atlas, 8 
complete, 9 
maximal, 9 
on a manifold with 
boundary, 20 
chain, 296 
chain group, 296 
chart, 7, 10 
on a manifold with 
corners, 252 
with corners, 252 
coordinate map, 10 
covector field, 70 
covering map, 28 
injective, 28 
is a local diffeomorphism, 
28 
is open, 28 
vs. topological, 28 
curve, 54 
dynamical systems, 333 
embedding, 94 
function 
coordinate representation, 
24 
extension of, 39 
on a manifold, 24 
function element, 56 
group action, 146 
homeomorphism, 99 
manifold, 1, 9 


is a manifold with 
boundary, 20 
structure, 9 
map 
between Euclidean 
spaces, 7, 425 
between manifolds, 24 
composition of, 25 
coordinate representation, 
25 
from a subset of R”, 428 
from base of covering, 29 
into a product manifold, 
26 
is a local property, 24, 25 
section, 59 
simplex, 296 
singular homology, 296 
isomorphic to singular, 


296 
structure, 9 
on R”, 11 


on a manifold with 
boundary, 20 
on a vector space, 11 
on spheres, 27 
on the tangent bundle, 57 
uniqueness, 26 
uniqueness, on R, 326 
uniqueness, on R”, 27 
with corners, 252 
triangulation, 303 
vector field, 60 
smoothly compatible charts, 7 
with corners, 252 
smoothly homotopic, 143 
smoothness is local, 24, 25 
smoothness of inverse maps, 99 
south pole, 21 
space, vector, 403 
over an arbitrary field, 404 
real, 403 
span, 404 
special 
linear group, 115, 125 


complex, 150 
connected, 168 
Lie algebra of, 388, 402 
orthogonal group, 125 
is connected, 165 
unitary group, 150 
is connected, 165 
Lie algebra of, 402 
sphere, 5 
de Rham cohomology, 290 
different smooth structures 
on, 27 
is an embedded 
submanifold, 103, 114 
is orientable, 237, 239, 265 
parallelizable, 64 
standard smooth structure, 
13 
stereographic coordinates, 
21 
vector fields on, 64 
spherical coordinates, 104 
square 
homeomorphic to circle, 7 
not diffeomorphic to circle, 
127 
standard 
basis, 406 
coordinates 
on R", 11 
on the tangent bundle, 58 
dual basis for R”, 66 
orientation of R”, 231 
simplex, 291 
smooth structure 
on R”, 11 
on S”, 13 
on a vector space, 11 
symplectic form, 222 
symplectic structure, 222 
star operator, Hodge, 268 
star-shaped, 87, 278 
starting point of an integral 
curve, 308 
stereographic 
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coordinates, 21 
round metric in, 197 
projection, 21 
Stokes orientation, 239 
Stokes’s theorem, 248 
for surface integrals, 264 
on manifolds with corners, 
254 
subalgebra, Lie, 372 
subbundle, 356 
tangent, 356 
subcover, countable, 4 
subgroup 
closed 
is embedded, 394 
of a Lie group, 392 
dense, of the torus, 125 
discrete, 160 
quotient by, 160 
Lie, 124 
one-parameter, 381 
subinterval, 433 
submanifold, 120 
calibrated, 304 
closest point, 144 
embedded, 97 
curve in, 128 
local characterization, 97 
open, 97 
restricting a map to, 122 
uniqueness of smooth 
structure, 98 
has measure zero, 132 
immersed, 119 
of a manifold with 
boundary, 120 
open, 12, 13 
is embedded, 97 
tangent space, 47 
regular, 97 
restricting a map to, 121 
embedded case, 122 
Riemannian, 191 
tangent space, 101, 102 
and defining maps, 116 
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transverse, 128 
submersion, 94 

and constant rank, 131 

composition of, 97 

into R*, 127 

is a quotient map, 111 

is open, 111 

level set of, 113 

passing to quotient, 112 

theorem, 113 
subordinate to a cover, 37 
subrectangle, 433 
subspace 

affine, 407 

coordinate, 413 

of a vector space, 404 

of Hausdorff space, 3 

of second countable space, 4 

projection onto, 408 
sum 

connected, 128 

direct, 407 

lower, 433 

upper, 433 
summation convention, 12 
support 

compact, 34 

of a function, 34 

of a section, 59 
supported in a set, 34 
surface integral, 264 

Stokes’s theorem for, 264 
Sym (symmetrization), 182, 205 
symmetric 

group, 183, 204, 206, 414 

matrix, 115, 412 

product, 183 

associativity, 197 
tensor, 182 
field, 184 

symmetrization, 182, 205 
symplectic 

basis, 221 

complement, 220 

coordinates, 349 


form, 221, 346 
standard, 222 
geometry, 222 
group, 227 
immersion, 222 
manifold, 221, 346 
orientable, 268 
structure, 221 
canonical, on T*M, 223 
on cotangent bundle, 222 
standard, 222 
submanifold, 222 
subspace, 220 
tensor, 219 
canonical form, 221 
vector field, 347 
vector space, 219 
symplectomorphism, 222 


J*(M) (space of covector fields), 
71 
2\ see torus 
J(M) (space of vector fields), 62 
, see torus 
(n-torus), 32 
tangent 
bundle, 57 
is a vector bundle, 59 
projection, 57 
smooth structure, 57 
standard coordinates, 58 
trivial, 62 
cotangent isomorphism, 192, 
197 
not canonical, 198 
covector, 68 
distribution, 356 
and smooth sections, 356 
determined by a foliation, 
365 
examples, 357 
spanned by vector fields, 
357 
space 
alternative definitions, 55 


ma 


nn, 
wa 


geometric, 42 
to a manifold, 45 
to a manifold with 
boundary, 53 
to a submanifold, 101, 
102, 116 
to a vector space, 48 
to an open submanifold, 
47 
to product manifold, 64 
subbundle, 356 
to a submanifold, 122 
vector 
geometric, 42 
in Euclidean space, 42 
local nature, 47 
on a manifold, 45 
to a curve, 54 
to composite curve, 55 
to curve, push-forward of, 
55 


tautologous 1-form, 222 
Taylor’s formula, 45 
tensor 


alternating, 202 
and orientation, 231 
elementary, 206 
bundle, 179, 180 
contravariant, 179 
covariant, 172 
field, 180 
contravariant, 180 
covariant, 180 
invariant under a flow, 
343 
Lie derivative of, 339 
mixed, 180 
smooth, 180 
symmetric, 184 
transformation law, 196 
left-invariant, 375 
mixed, 179 
product 
and multilinear maps, 178 
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characteristic property, 
176 
of tensors, 173 
of vector spaces, 176 
of vectors, 176 
uniqueness, 196 
symmetric, 182 
time-dependent vector field, 326 
top-dimensional cohomology 
of nonorientable manifold, 
281 
of orientable manifold, 279 
topological 
boundary, 20 
covering map, 28 
vs. smooth, 28 
embedding, 94 
group, 30 
invariance of de Rham 
cohomology, 277 
manifold, 1, 3 
with boundary, 19 
topology, norm, 423 
torus, 14 
as a Lie group, 32 
dense curve on, 95, 126 
as immersed submanifold, 
119 
is not embedded, 126 
dense subgroup of, 125 
exact covector fields on, 91 
flat metric on, 199 
Lie algebra of, 375 
of revolution, 191, 266 
parallelizable, 64 
smooth structure on, 14 
total derivative, 428 
total space, 59 
trace, 168 
transformation 
of coordinate vectors, 52, 68 
of covector components, 68, 
69, 77 
of vector components, 52, 69 
transition 
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map, 7 

matrix, 230, 410 
transitive group action, 147 
translation 

left, 32 

lemma, 309 

right, 32 
transpose 

of a linear map, 67 

of a matrix, 412 
transposition, 414 
transverse 

intersection, 128 

submanifolds, 128 

vector, 237 

vector field, 237 
triangular matrix, upper, 420 
triangulation, 303 

smooth, 303 
trivial bundle, 59 

and global frame, 64 

tangent, 62 
trivialization 

global, 59 

local, 59 

and local frame, 64 

tubular neighborhood, 140 

theorem, 140 


U(n) (unitary group), 150 
uniform convergence, 439 
uniqueness 
of quotients, 113 
of smooth structure, 26 
on embedded 
submanifold, 98 
unit 
circle, 14 
element of a Lie group, 30 
sphere, 5 
vector, 421 
unitary group, 150 
diffeomorphic to 
S! x SU(n), 167 
is connected, 165 


Lie algebra of, 402 

special, 150 

special, is connected, 165 

special, Lie algebra of, 402 
unity, partition of, 37 

existence, 37 
universal covering group, 380 
upper 

half space, 19 

integral, 434 

sum, 433 

triangular matrix, 420 


V* (dual space), 66 
V** (second dual space), 67 
isomorphic to V, 67 
vanishes 
along a submanifold, 123 
at points of a submanifold, 
123 
variational equation, 321 
vector, 404 
addition, 403 
bundle, 58 
isomorphic, 60 
isomorphism, 60 
projection is a 
submersion, 95 
restriction of, 59 
section of, 59 
components, 50 
transformation law, 52, 69 
contravariant, 69 
coordinate, 50 
covariant, 69 
field, 60 
along a submanifold, 199, 
236 
canonical form, 331 
commuting, 335-337 
complete, 316, 317 
component functions of, 
60 
conservative, 91 
coordinate, 61 


directional derivative of, 
327 
globally Hamiltonian, 347 
Hamiltonian, 347 
invariant under a flow, 
335 
invariant under a map, 
312 
invariant under its own 
flow, 312, 313 
Lie algebra of, 372 
line integral of, 91 
locally Hamiltonian, 347 
on a manifold with 
boundary, 62 
push-forward, 62, 63 
restriction, 122 
smooth, 60 
smoothness criteria, 60 
space of, 62 
symplectic, 347 
time-dependent, 326 
transverse, 237 
geometric tangent, 42 
space, 403 
finite-dimensional, 405 
infinite-dimensional, 405 
oriented, 231 
over an arbitrary field, 
404 
real, 403 
smooth structure on, 11 
tangent space to, 48 
tangent 
local nature, 47 
on a manifold, 45 
to composite curve, 55 
transverse, 237 
vector-valued function, 23 
vertex of a simplex, 291 
volume, 1, 437 
and determinant, 225 
decreasing flow, 345 
and divergence, 345 
element, Riemannian, 258 
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form, Riemannian, 258 
in coordinates, 258 
on a boundary, 262 
on a hypersurface, 260 
increasing flow, 345 
and divergence, 345 
measurement, 201 
of a rectangle, 433 
of a Riemannian manifold, 
260 
preserving flow, 345 
and divergence, 345 


wedge product, 209 
Alt convention, 212 
anticommutativity, 210 
associativity, 210 
bilinearity, 210 
determinant convention, 212 
uniqueness, 211 
Weinstein, Alan, 351 
Whitney 
approximation theorem, 138 
on manifolds, 142 
embedding theorem, 136 
strong, 137 
immersion theorem, 134 
strong, 137 
Whitney, Hassler, 137 


X? (X flat), 193 
é# (€ sharp), 193 


zero section, 59 
zigzag lemma, 286 
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